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— ^ — ^ I CHAPTlERII 



IPEA/yS. 



PCX 

DEMAND 



For International Prelinunaor Examining Authority use only 



iHrntificationoflPEA 



BOX NO. I IDENTIFICATION OF THE INTERNATIONAL APPLICATION 



International application No. 
PCT/US03/i7725 



HotA nf recei pt of DEMAND 

Applicant's or gent's file reference 

(Earliest) Priority date {day/ntonth/year) 
04 June 2002 (04.06.02) 



International filing date (day/monthfyear) 
04 June 2003 (04.06.03) 
Title of invention 

K^ODS n,>aKQS».0 ^ T«BAT»<G DIABETES AND mS»UN RBSg^A^|CE 



BOX No. II APPLICANT(S) 



METABOLEX, INC. 
3876 Bay Center Place 
Hayward, Galifomia 94545 
United States of America 



Telephone No.: 
510.293.8800 



Facsimile No.: 
510.293.9090 



Teleprinter No.: 



Applicant's registration No. with the Office 



State {that is, country) of nationality: 
US 



State (that is. country) of residence; 
US 



Nameand address: {Fan^ynan^Joiio^by^^ foraHaie^Moi^^^^^ 



ALLAN, Bernard 

940 Guerrero Street 

San Francisco, California 94110 

United States of America 



State {fitoi is, country) of nationality: 
IE 



State {that is, country) of residence: 
US 



Name 



The address must inciude postal code and nam of country.) 



GRECjOIRE, Francine 
1044 Carol Lane 
Ufayette, California 94549 
United States of America 




^ Further applicants are indicated on a co ntinuation sheet. 

J^^ii^^Cmmx (continuation sheet)(MarchiOOl; .epiuUJanuaij im 



See Notes to the demand form 



Sheet No. 2 



International application No. 
PCT/US03/17725 



CoDtinuation of Box No. i^APPLICANT(S) 



If none of the following sub-boxes 



is used, this sheet should not be included in the demand. 



necddressnmtindudepostalcodeaiuinmerfcountn^) 



State {that is. country) of residence: 
US 



State {that is, country) of residence: 




The address must include postal code and name of county.) 



State (that is, country) of residence: 



The address must include postal code and name of country.) 



ion. The address must include postal code and name of country.) 



Vie address must tndude postal code and name of country.) 



State {that is. country) of nationality: 



Box No, in AGENT OF 



Sheet No. 3 

kMON REPRESENTATIVE; OR ADDRESS Fj 



International application No. 
PCT/US03/17725 

IRRESPONDENCE 



m following person is agent □ common representative 

and El hasbeenappointedearlierandrepresentstheapplicant(s)als^ 

□ is hereby appointed and any earlier appointment of (an) agent(s)/^^^ 

□ ishereby appointed, specincallyfor the procedure before the I^^^^^^^ 
the aeentCsVcommon representative appointed earhcr. 



Name and address: (Wn..e/o/W6,gfv«^^^ 

ne address must include postal code and name of country.) 

HINSCH, Matthew, E.. ^„ r r t n 

TOWNSEND AND TOWNSEND AND CREW LLP 
Two Etnbarcadero Center, 8th Floor 
San Francisco, California 941 1 1-3834 
United States of America 



Telephone No.: 
415.576.0200 



Facsimile No,: 
415,576,0300 



Teleprinter No.: 



Agent's registration No, with the Office 
47,651 



n Addressforcorrespondence: Mark this check:^here no agen^ 

^ ^Z J>Z is used instead to indicate a special address to which correspondence should be sent. 



Box No. IV BASIS FOR INTERNATIONAL PRELIMINARY EXAMINATION 



Statement concerning amendments:* e*..^ nn thP basis of 

I . Ue applicant wishes the international preliminary examination to start on the basis of. 

^ the international application as originally filed 
the description □ as originally filed 

^ as amended under Article 34 

the claims El as originally filed 

□ as amended under Article 19 (together with any accompanying statement) 

Q as amended under Article 34 

the drawings S as originally filed 

Q as amended under Article 34 

2 □Ueapplican.wishesanyamendmenttoAedain.under Article .9.obeconsW^^^^ 

3 □ Ue applicant wishes d,e start of the international .nelinuna^ — ll^^^^^^^^^ 

T^maZmarkedonlywherethetiMtunderArticle^ 

or the international preliminary examination report, as so amended. 




excluding the following Stotes which the applicant wishes not.to elect 



Form PCr/IPEA/401 (continuation sheet) (Maidi 2001; reprint January 2003) 



See Notes to Ihe demand farm 



Sheet No. 4 



International application No. 
PCT/US03/17725 




n^e demand is accompanied by the following elements, in the language referred to in 
Box No. IV, for the purposes of international preliminary exammation: 

translation of international application 

2. amendments under Article 34 

3 copy (or, where required, translation) of 
amendments under Article 19 



4 sheets 



4. copy (or. where required, translation) of 
statement under Article 19 

5. letter 

6. oiher {specify) 



sheets 

sheets 

' sheets 



1 sheet 



sheets 



For International Preliminary 
Examining Authority use only 

received not received 



□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 



-n^c demand is also accompanied by the item (s) marked below; 
I.. S fee calculation sheet 

2. Q original separate signed power of attorney 

3. D original general power of attorney; 

i 

4. D copy ofgencral power of attorney; 
reference number, if any: 



5. □ statement explaining lack of signature 

6. S sequence listing in computer readable form 

7. □ tables in computer readable form related to 

sequence listings 

8 other ispecijy) Transmittal Utter; Article 34 ^^^^f 

with four (4) substitute specification pages 4, 22, 2i ana 
42; Twenty (20) pages of Sequence Listing, Statement 
and Diskette; Postcard 



— . OF APPLICANT, AGENT OR COMMON REPRESENTATIVE 




Matthew E.Hinsch 

TOWNSEND AND TOWNSEND AND CREW LLP 
USPTORcg.Na: 47,651 

Applicants* Agent ' 



For International Preliminary Examining Authority use only 



Date of actual receipt of DEMAND: 



2. Adjusted date of receipt of demand due 
to CORRECTIONS under Rule 60. 1(b): 



n ITiC date of receipt of the demand is AFTER the expiration of 19 months 
from the priority date and item 4 or 5, below, does not apply. 



n The applicant has been 
informed accordingly. 



4 □ The date of receipt of the demand is WITHIN 
Rule 80.5. 



5. □ ^^^^..^^'^^'"^"^"^'•""^'''-'"^'"^'"^'"'^ 
is EXCUSED pursuant to Rule 82. ^ , 



For International Bureau use only 




Demand received from IPE A on: 
Form PCT/IPEA/401 (continuation sheet) (March 2001; repnnt January 2003) 



See Notes to the demand form 



CHAPTER IlH 



PCX 

FEE CALCULATION SHE^ 



Annex to the Demand 

For International Preliminaiy Examining Authority use only 



international 

Ipplir^tionNo. PCT/US03/I7725 




Applicant's or agent's 

fir.reference 16325-I39PC 


Date stamp of the IPEA 


Applicant 

METABOLEX, INC. etai 





CALCULATION OF PRESCRIBED FEES 



' I . Preliminary examination fee 

2. Handling fee (Applicants from certain States are 
entitled to a reduction of 75% of the handling fee. 
mere the applicant is (or all applicants are) so 
entitled, the amount to be entered at H is 25% of the 
handlingfee.), 

3, Total of prescribed fees 

Add the amounts entered at P and H 

and enter total in the TOTAL box 



490.00 



172.00 



H 





662,00 






TOTAL 





MODE OF PAYMENT 

^ authorization to charge deposit 
account with the IPEA (see below) 

Q cheque 

pi postal money order 

Q bank draft 



ni cash 

[~\ revenue stamps 

f~] coupons 

□ other (specify): 



AUTHORIZATION TO CHARGE CREDIT) DEPOSIT ACCOUNT 
(This mode of payment may not be available at all ll'tAsy 

la isherebyauthorizedtochargethetotalfeesindicatedabovetomydepositaccount 

my deposit account 



The IPEA/ US 



20-1430 



S January 2004 
Xyz!Lt\dav/monih/Year) 



Deposit Accou nt Number - — : . , .^y. 

"Form PCT/IPEA/401 (Annex) (July 1998; repnnt July 1999) 
(60109700 vl) 



vA)Hii 



<^imiat.i«> Matthew k/Hinsch 

— °= See Notes to the fee calculation sheet 



IWl In some onbodtaenB, the detecting step comprises quantifying 
„*NA encoding ^.ypepti-e. U, some embodiments. th^^NA is revere transonbed 

and amplified in a^ymera«<:'»''"»"'''''^ ... 

(171 in some embodiments, the sample is a blood, nrine or tissue sample. 

,181 The present invention also provides isolated nucleic acids that 
Kybridi^e under stringentconditionstoapolynucleotideencodingapolypepUdeha^^^ 
loacidse,uencese,ec.ed.^mthegr„upconsisU„go,SEQa,NO:2,SEQmNO:4. 

SEQ ID N0:6. SEQ ID N0:8. SEQ ID NO:10. SEQ ID N0:12 or SEQ ID N0:14. 

,191 In some embodiments, the nucleic acid is selected fom the group 
consisting of SEQ ID N0:1. SEQ m N0:3. SEQ ^ N0:5. SEQ ID N0:7. SEQ ID N0:9. 

SE0IDN0:11 orSEQIDNO;13 

,201 The present inv«,tion also provides an expression cassette compnsmg 

aheterologous promoter operablyUnkedtoapolynucleotidethathybridizes under strtugen. 

conditionstoanucleicacidencodingapolypeptidehavinganaminoa.^^^^^ 
,i„mthegroupconsistingotSEQIDNO:2.SEQIDNO:4.SEQmNO:6.SEQmN0.8, 

SEQlDNO;10.SEQIDNO:12orSEQIDNO:14. , , ' 

,21, The present invention also provides host cells transtected wtth a 
polynucleotidethathybridizesunderstdngentcondiUonstoanucleicacidencodi^a^ 

polypeptide having an amino acid sequence selected fh,m the ^P — "'''^^^ 
NO^. SEQ ID N0:4. SEQ m N0:6. SEQ ID N0:8. SEQ m HO:10. SEQ U) Na.l2 or SEQ 
n,N0:14. Insomeembodiments.thehos.cellisahun,ancell. In some embod.men.^ the 

host cell is a bacterium. 

,221 The present invention also provides isolated polypeptides compnsmg 
3„aminoacidse,uenc..Ieast70%idenHcaltoSEQmNO:2.SEQIDNO:4.SEQID 

NO-6 SBQIDNO:8.SEQIDNOaO.SEQIDNO:12orSEQIDNO:14. tasome 

NO.e..Msy .■.„i.<!F.OIDNO-2SEQIDNO:4.SEQIDNO:6.SEQID 
embodiinents. the polypeptide IS SEQ ID NU.A acv 

N0:8. SEQ ID NO:10. SEQ ID N0:I2 or SEQ ID N0:14. 

DEFINITIONS ' 
, ,231 •Insulinsensitivity-referstotheabilityofacellortissuetorespondto 

i^lin. Responses include. e.g.glucoseup.alceofacel. or tissneinrespot^etoin^^ 

stimulation, sensitivity canbedetermined a. anorga„isma,,.iss.eorc.n.aarleveI For 

example. bloodorurineglucoselevelsfollo«ingaglucosetolerance,estare.nd.cat.veof 

insulin sensitivity. Other methodsofmeasuringinsuUnsensitivi^, include. e.g.,m.as,™g 



S«.danl Proteto S.p«.fl.. T«bol,.« For Purifying Pr..eias 



I Fractionation 

,^;;;;:~:^s.ep. and if *e pro.Kx«.e is complex an m ma. 

LmeceUculure media) ftomftcrcoombinan,pro«inofin««a.. The pref««d s^. .s 

— snifa.e.ec.pUa.esp»^ 
r»tofwa.erinU,ep„.einmix.ure. Pro.ei.d,enp«oipi.»on.hebas,som^^^^ 

lbi.i.yThemorei,ydrophob,capro,ei„is,.hemore,ike,y,.,s.oprec,p,«ea..ow« 

:!l:.s„.ra.ec„ncen.^>ons...ypica,p™.oco.is.oaddsa.„.edamm^^^ 
rp».einso,u«onso.ha..he«s„.U„.ammo„iumsu,fa,eco„cen,,a.,o„«b^nv^20.^ 

complex protein mixtures. 

0 <;i7P. Differential Filtration 

,83, Basedo„acalcu.a,edmoleouIarwei^..aproteinotgr.a.erandlesser 
,..anbei»,a.ed„si„gu,«afa.,aao„U.oughme.b«nesofdiffere„.poresiz»(for 
example A»iconorMiliiporemembranes).Asaf„3.s.ep,*eprotemm.x»re s 

, "rU.. Tbe — p».ein»i«pa.«nougb.bememb™em.o.e 
r„tta.e. Thefiliralecanthenbeci^omatographedasdescribedbelow. 

,841 Tl>epro.einsofin.eres.oanalsobesepara.edih>motopro«.nson 

U,e basis of .heir size, ne. surface charge, hydrophobiciiy and aMy for iigand,. 
,„ :l:.an«bodie.raisedagains.pro.einsoaubecouiug..»co„^ 

■r,^A All of these methods are well known in the art. 
^""■^•■^^rl— chroma.ographyusi„gan«b<.esr.sed..varie.of 

^rf„ .ags such as hemaggiuUmn (HA), FLAG, Xprcss. Myc. hexahistidine (SEQ m 
;!^,5)0^is).g.u«hioneS.ransferase(GST)and*e.ikccanbeu.edu.pur,.y 



22 



10 



15 



20 



25 



30 



polypeptides. TheHis tag will also act asachelating agent forcert^^ 
Lthe metalsca^beused to purify His-containingpol^tides.^^^ 

tag is optionally relied by specific proteolytic cleavage. 

(861 It will be apparent to one of skill that chromatographic techniques can 
beperformedatanyscaleandusingequipmentfrommanydifferentmanufacturers(e.g.. 

Pharmacia Biotech). 

,V. DETECTION OF POLYNUCLEOTIDES OF THE INVENTION 

(871 Those ofskill in the art will recognize that detection of expression of 
polynucleotides and polypeptides of the invention has n«ny uses. For example, as discussed 
herein, detection of levels of polynucleotides and polypepHdes of the invendon in a pah». ts 
useful for dia^osing diabetes or a predisposition for at leas, some of the patological effects 

of diabetes. Moreover, detection of gene expression is useM to identify modulators of 
expression of polynucleotides and polypeptides of the invention. 

,88] A variety of methods of speciHc DNA and RNA measurement that use 
nucleic acid hybridization techniques are known to those of skill in the aK isee. Sambmok, 
™) Some methods involve a. el«*ophoretic separation Southern blot for detectmg 
DNAandNorthemblo,fordetec.mgRNA).b«tmeasnrementofDNAandRHAcanalsobe 

oarriJdout in the absenceofelee.rophoreticseparation(..g.,by do. blot), southern blot of 

genomic DNA(eg.,ir.mahuman)canbe used for sereeningfor restriction fagmentl-g* 
polymorphism (RFLP)to detect the presenceof.genetic disorder affectingapolypephdeof 

the invention. . i a 

(891 n.eselectionofanucldcacidhybridizattonfonnattsnotent,cal.A 

variety of nucleic acid hybridization fommts are known to those skilled in the art. For 
example, common formats includesandwich assays and competition or displacement assays. 

Hybridizationtechni,uesarege„erallyde^ribedinHamesandHiggins*c/«c^a<( 
HyM^o,ion. A PraCical Approack. IRL Press (1985); Gall and Pardue, Proo. NaU. Acad. 
Sci as J., 63:378-383 (1969); and John el al. Nature, 223:582-587 (1969). 

(901 Detectiohaf a hybridization complex may require the bindmg of a 
signal-genet^ingoomplextoaduptooftargetandpr^epolynucleotidesornucleicacids. 
Typically.suchbindingoccurs through Ugand and anti-ligand interaction as betweena 

llgand-conJugatedprobeahdananfl-ligandconjnga.edwim.signal. Ute billing of the 
signal gene^ioneomplex is also readilyamenabletoaceelerationsbyexposuretoultras^^ 

energy. 
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10 



15 



,162- SmuUriy.anyhap«nicoran«geniooompoundcanbeusedin 

^to„.i»«rI».«nrecep.o.,— 0810 u,,„,e^^^^^^ pi,„„ & Power. * 

f.n,ilv Ihe inteerin family, the selectin family, and the hke, see. e.g.. 6 

family, the mtegnni y, 3„<, venoms, viral epitopes, 

AMesU,nMolea.leFac^BootI(im)- S.m.larly,to ^^^^ „^i„e ,he elTeCs 

hormon.s(.g.,opi.t.s,s.er„ids,*.).intrac..liJarr«>eptors(.g.,wh.chm^^^^^^^ 

"s^aiiU-.---'-^"'^"'^— ''T: : ■ 

,ept,des,,d..,.-,s„^n..eio.^^^^^ 
configurations), oligosacchandes. proteins, phospholipids an 

various cell receptors. nnlvesters oolycarbonates, 

,163] Synthetic polymers, such as polyurethanes, polyesters, poly 

..^a.Hyamides,polye.y,eneimin.^»e« 

'-"'^''^tr— ■hdcerss.haspept.des.polye.he.andtheli.ecanaUo 

^eas.ags.andinc,..depo,ypep.idese<,ue„ces,suchaspo.y-ayse<,ue^^fbe«^^^^ 

M,«FOIDN016). Suchflexiblehnkersareknowntothoseol 
^.Sand^OOaminoaoi^CSEQmN^^^^^^^ 

..kin in the art. For example, poly(ethyiene giywii; 

Crs,,no.,H™tsvi«=...abam. these linkers optionally have amide linkages, 
in,„^r^,uinkaBes or heterofunctional linkages. 

^•''^'^ r^Ta^binde^arefixedtosolidsu^ 

' ^^nivHmvatizedorfimctionalizedby 
•I ui» QriiiH «!ubstrates are commonly aenvauzeu ui lui 
methods currently available. Solid substrates 

«f the ciubstrate to a chemical reagent that fixes a cnemica f 

,0 ««1>«*'8^'°"'"'"°"''"'!™' "tLfmetagbinder. For example, groups that are 
O.esurfacea^tisrea.ivew.thapo^nof^'^^h^^^^^ 

s„iU*le for attachment to a longer Cham portion w ._^„fu„ctionalizea 

rr—l^Lhlopolymer 
variety of surfaces, such as glass surfaces. The construe 
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Palo Alto. Cainornia 
Tel 550326-2400 

Der.ver. Colorado 
Tel 303 5714000 
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•atifoinia 



SaaiiSe. VVastiingion 
Igl 206 457-9600 



San Francisco 

Two Embarcadero Center 
8th Floor 
San Francisco 
California 94^1-3834 
Tel 415 576 0200 
Fax 415 576-0300 



VIA EXPRESS MAIL 

Special Programs Examiner Sue Wolski 
Mail StopPCT 
Commissioner for Patents 

P.O. Box 1450 
Alexandria, VA 22313-1450 

Re: International ApplicationNo.PCT/US03/17725 

filed 04 June 2003 

Applicant(s): Metabolex, Inc. a/. 

SMlodll"S:Ung&Treati^^ 
OurRef.: 16325-139PC 

Dear Special Programs Examiner Wolski: 

. th. WTPO Standard ST.25 requirements for the Sequence 
In order to comply with the WIPO ^ this time. 

Listing-please find enclosed the Sequence Listmg and disk at this tim 

disclosure in the international application as filed. 




MEHidniw 
Enclosures 

60072934 vl 



SEQUENCE LISTING 



<110.> Allan, Be^^d 

Grego i re , ^Plnc ine 

La van, Brian 
Moodie, Shonna 
Metabolex, Inc. 

. ' c Treatinq Diabetes and Insulin, 

<120> Methods of Diagnosing & Treating ui 

Resistance 

<130> 016325-013900PC 

<140> WO PCT/US03/17725 
<141> 2003-06-04 

<150> US 60/386,085 
<151> 2002-06-04 

<150> US 60/386,331 
<151> 2002-06-05 

<16Q> 16 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 2312 

<212> DNA 

<213> Homo sapiens 

hun.an connective tissue growth factor (CTGF, cDNA 

<220> 

<22l> CDS 

<222> (146) - . (1195) . 
<223> CTGF 

<400> 1 ^ ^^naaaraac aqcccggcgc gtcccggtcc 60 

tccagtgacg gagccgcccg gccgacagcc l^lll^f l ^Jg^tcgccg ccaccgcgcc 120 
ccacctccga ccaccgccag cgctccaggc cccgcgctcc <^^9 IJ ^eo 
ctccgctccg cccgcagtgc caaccatgac ^llllllt Lccagaact gcagcgggcc 240 
cttcgtggtc ctcctcgccc tctgcagccg g^^g^^^^^^ l^cgtiagcc tcgtgctgga 300 
. gtgccggtgc ccggacgagc cggcgccgcg ^^gcccggcg 99 9 ^ ^ ccgagcgcga 360 
?ggctgcggc tgctgccgcg tctgcgccaa 9cagctgggc 9^9 J ,,,gcaagat 420 
cccctgcgac ccgcacaagg gcctcttctg tgacttcgg tgtaccgcag 480 

cggcgtgtgc accgccaaag atggtgctcc ctgcatcttc 99^99 gg 9 
cigagagtcc ttccagagca gctgcaagta ccagtgcacg tg 99 9 ^^^^^^^^^g eoo 
ctgcatgccc ctgtgcagca tggacgttcg tctgcccagc ^99 ^^^^^^^^^^ 
gagggtcaag ctgcccggga aatgctgcga ggagtgggtg ^9^9 9^9 gcccagaccc 720 
aaccgtggtt gggcctgccc tcgcggctta ^^9actgga 9 gttccaagac 780 

aactatgatt agagccaact gcctggtcca g^^^^^^9ag ^9g g 3 ^agagaa 840 

ctgtgggatg ggcatctcca cccgggttac ^^^^gacaac 9^ ^ ^ acattaagaa 900 
gcagagccgc ctgtgcatgg tcaggccttg -g^^9ctgac ^^99 9 9 ^ctgg 960 

gggcaaaaag tgcatccgta ctccca.aaat ^tccaagcct 9^^^| acggccgatg 1020 

c?gcaccagc atgaagacat accgagctaa ^"^^9tgga 9 9 ^^gg^gaggt 1080 

ctgcaccccc cacagaacca ccaccctgcc g9tggagttc 9 9^^^^ gtcccggaga 1140 
catgaagaag aacatgatgt tcatcaagac ^tgtgcctgc catgaagcca 1200 

caatgacatc tttgaatcgc tgtactacag 9aagatgtac 99 9 ^tcatttttc 1260 

gagagtgaga gacattaact cattagactg gaacttgaac tgattca 



1 



cgtaaaaatg atttcagtag cacaagttat ttaaatctgt ttttctaact SSgggaaaag 20 
a?tcccacc? aattcaaaac attgtgccat 9tcaaacaaa tagtctatct tccccag ^^^.^ 
ctggtttgaa gaat^ag acttgacagt gg^actacat ^agtac||ag ^ ^ ^^^O 

tatattaagg tgtd«ta ggagcagtgg gaggg^acca gcaga«t 9 ^^^^^ ^^^^ 
agatagctct tata^ta atatgcctgc tatttgaagt gtaatoj^a 99 
tagcgtgctc actgacctgc ctgtagcccc agtgacagct aggatgtgca ttctcc g ^^^^ 
atcaagagac tgagtcaagt tgttccttaa 9tcagaacag -gactcagc tctg ^^^^ 
tgattcgaat gacactgttc aggaatcgga atcctgtcga ttagactgg^ ,,?,g,|tgt 1800 
caagtgaatt tcctgtaaca agccagattt tttaaaatcc ^ ttaaagttgt 1860 

gtgtgtgtgt gtgtatatat atatatatat 9tacagttat ctaagttaat tt^^^g^^g^ ^^^^ 
ttgtgccttt ttatttttgt ttttaatgct "gatatttc ^^^9 9 ^^g^ 
aacaccatag gtagaatgta aagcttgtct gatcgttcaa agcatgaaat 9ga 
atggaaattc tctcagatag aatgacagtc cgtcaaaaca gattgtttgc -agggg^gg ^^^^ 
catcagtgtc cttggcaggc tgatttctag g^aggaaatg ^99 9 ^tttttc 2160 

taatgaacaa atggccttta ttaaaaactg agtgactcta ^atagctg 9 .^^O 
acctggaagc atttgtttct actttga a g gt tt c^gsacag ^^^^l^^^^, ,,,, 
agtgtgacca aaagttacat gtttgcaccc ccc^uciyu y ^^^^ 
aaaaaaaaaa aaaaacgaca gcaacggaat tc 



<210> 2 
<211> 349 
<212> PRT 

<213> Homo sapiens 
<220> 



:223> human connective tissue growth factor (CTGF) 



<40p> 2 , « ,r 1 aiA Phe Val Val Leu 

Met Thr Ala Ala Ser Met Gly Pro Val Arg Val Ala Phe va 

1 5 10 ' 

.eu Ala Leu Cys Ser Arg Pro Ala Val Gly Gin Asn Cys Ser Gly Pro 



20 25 



cys Arg Cys Pro Asp Glu Pro Ala Pro Arg Cys Pro Ala Gly Val Ser 
35 

l.eu val Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu 

50 55 
Oly Glu Leu cys Thr Glu Arg Asp Pro Cys Asp Pro His Lys Gly Leu 
65 70 75 

Phe cys ASP Phe Gly Ser Pro Ala Asn Arg Lys He Gly Val Cys Thr 

85 

Ma Lys ASP Gly Ala Pro Cys He Phe Gly Gly Thr Val Tyr Arg Ser 

... 100 

Gly Glu ser Phe Gin Ser Ser Cys Lys Tyr Gin Cys- Thr Cys Leu Asp 



115 



Gly Ala val Gly Cys Met Pro Leu Cys Ser, Met Asp Val Arg Leu Pro 
130 

ser pro Asp Cys Pro Phe Pro Arg Arg Val Lys Leu Pro Gly Lys Cys 
145 150 155 

cys Glu Glu Trp val Cys Asp Glu Pro Lys Asp Gin Thr Val Val Gly 



165 170 



Pro Ala Leu Ala Ala Tyr Arg Leu Glu Asp Thr Phe Gly Pro Asp Pro 
180 185 190 




Thr Met lie Arg^PF Asn Cys Leu Val Gin Thr Thr Gi^^rp Ser Ala 

195 200 201 

Cys Ser Lys Thr Cys Gly Met Gly lie Ser Thr Arg Val Thr Asn Asp 

210 215 220 

Asn Ala Ser Cys Arg Leu Glu Lys Gin Ser Arg Leu Cys Met Val Arg 
225 230 235 240 

Pro Cys Glu Ala Asp Leu Glu Glu Asn lie Lys Lys Gly Lys Lys Cys 
245 250 255 

lie Arg Thr Pro Lys lie Ser Lys Pro lie Lys Phe Glu Leu Ser Gly 
260 265 270 

Cys Thr Ser Met Lys Thr Tyr Arg Ala Lys Phe Cys Gly Val Cys Thr 
275 280 285 

Asp Gly Arg Cys Cys Thr Pro His Arg Thr Thr Thr Leu Pro Val Glu 
290 295 300 

Phe Lys Cys Pro Asp Gly Glu Val Met Lys Lys Asn Met Met Phe lie 
305 310 315 320 

Lys Thr Cys Ala Cys His Tyr Asn Cys Pro Gly Asp Asn Asp lie Phe 
325 330 335 

Glu Ser Leu Tyr Tyr Arg Lys Met Tyr Gly .Asp Met Ala 
340 345 



<210> 3 

<211> 2330 

<212> DNA 

<213> Mus musculus 

<220> 

<22 3> mouse connective tissue growth factor (CTGF) 
homo log cDNA 

<220> 

<221> CDS 

<222> (204) (1250) 
<223> CTGF homolog 

<400> 3 

agactcagcc agatccactc cagctccgac cccaggagac cgacctcctc cagacggcag 60 

cagccccagc ccagccgaca accccagacg ccaccgcctg gagcgtccag acaccaacct 120 

ccgcccctgt ccgaatccag gctccagccg cgcctctcgt cgcctctgca ccctgctgtg 180 

catcctccta ccgcgtcccg atcatgctcg cctccgtcgc aggtcccatc agcctcgcct 240 

tggtgctcct cgccctctgc acccggcctg ctacgggcca ggactgcagc gcgcaatgtc 300 

agtgcgcagc cgaagcagcg ccgcactgcc ccgccggcgt gagcctggtg ctggacggct 360 

gcggctgctg ccgcgtctgc gccaagcagc tgggagaact gtgtacggag cgtgacccct 420 

gcgacccaca caagggcctc ttctgcgatt tcggctcccc cgccaaccgc aagattggag 480 

tgtgcactgc caaagatggt gcaccctgtg tcttcggtgg gtcggtgtac cgcagcggtg 540 

agtccttcca aagcagctgc aaataccaat gcacttgcct ggatggggcc gtgggctgcg 600 

tgcccctatg cagcatggac gtgcgcctgc ccagccctga ctgccccttc ccgagaaggg 660 

tcaagctgcc tgggaaatgc tgcgaggagt gggtgtgtga cgagcccaag gaccgcacag 720 



3 



cagttggccc tgccctagct gcctaccgac tggaagacac atttggccca gacccaacta 780 
tgatgcgagc caactgcctg gtccagacca cagagtggag cgcctgttct aagacctgtg 840 
gaatgggcat ctcc^^ga gttaccaatg acaatacctt ctgcag^^g gagaagcaga 900 
gccgcctctg catgj^^g ccctgcgaag ctgacctgga ggaaa^^^ aagaagggca 960 . 
aaaagtgcat ccggacacct aaaatcgcca agcctgtcaa gtttga^fft tctggctgca 1020 
ccagtgtgaa gacatacagg gctaagttct gcggggtgtg cacaigacggc cgctgctgca 1080 
caccgcacag aaccaccact. ctgccagtgg agttcaaatg ccccgatggc gagatcatga 1140 
aaaagaatat gatgttcatc aagacctgtg cctgccatta caactgtcct ggggacaatg 1200 
acatctttga gtccctgtac tacaggaaga tgtacggaga catggcgtaa agccaggaag 1260 
taagggacac gaactcatta gactataact tgaactgagt tgcatctcat tttcttctgt 1320 
aaaaacaatt acagtagcac attaatttaa atctgtgttt ttaactaccg tgggaggaac 1380 
tatcccacca aagtgagaac gttatgtcat ggccatacaa gtagtctgtc aacctcagac 1440 
actggtttcg agacagttta cacttgacag ttgttcatta gcgcacagtg ccagaacgca 1500 
cactgaggtg agtctcctgg aacagtggag atgccaggag aaagaaagac aggtactagc 1560 
tgaggttatt ttaaaagcag cagtgtgcct actttttgga^ gtgtaaccgg ggagggaaat 1620 
tatagcatgc ttgcagacag acctgctcta gcgagagctg agcatgtgtc ctccactaga 1680 
tgaggctgag tccagctgtt ctttaagaac agcagtttca gctctgacca ttctgattcc 174 0 
agtgacactt gtcaggagtc agagccttgt ctgttagact ggacagcttg tggcaagtaa 1800 
gtttgcctgt aacaagccag atttttattg atattgtaaa tattgtggat atatatatat 1860 
atatatatat atttgtacag ttatctaagt taatttaaag tcatttgttt ttgttttaag 1920 
tgcttttggg attttaaact gatagcctca aactccaaac accataggta ggacacgaag 1980 
cttatctgtg attcaaaaca aaggagatac tgcagtggga attgtgacct gagtgactct 2040 
ctgtcagaac aaacaaatgc tgtgcaggtg ataaagctat gtattggaag tcagatttct 2100 
agtaggaaat gtggtcaaat ccctgttggt gaacaaatgg cctttattaa gaaatggctg 2160 
gctcagggta aggtccgatt cctaccagga agtgcttgct gcttctttga ttatgactgg 2220 
tttggggtgg ggggcagttt atttgttgag agtgtgacca aaagttacat gtttgcactt 2280 
tctagttgaa aataaagtat atatatattt ttatatgaaa aaaaaaaaaa 2330 



<210> 4 
<211> 348 
<212> PRT 

<213> Mus musculus 
<220> 

<223> mouse connective tissue growth factor (CTGF) 
homolog 



Val Ala Gly Pro He Ser Leu Ala Leu Val Leu Leu 
5 10 15 

Ala Leu Cys Thr Arg Pro Ala Thr Gly Gin Asp Cys Ser Ala Gin Cys 
20 25 30 

Gin Cys Ala Ala Glu Ala Ala Pro His Cys Pro Ala Gly Val Ser Leu 
35 40 45 

Val Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu Gly 
50 55 60 

Glu Leu Cys Thr Glu Arg Asp Pro Cys Asp Pro His Lys Gly Leu Phe 
65 70 75 80 

Cys Asp Phe Gly Ser Pro Ala Asn Arg Lys He Gly Val Cys Thr Ala 
85 90 95 

Lys Asp Gly Ala Pro Cys Val Phe Gly Gly Ser Val Tyr Arg Ser Gly 
100 105 110 



<400> 4 

Met Leu Ala Ser 
1 



4 



Glu Ser Phe Gin Ser Ser Cys Lys Tyr Gin Cys Thr Cys Leu Asp Gly 
115 120 125 

Ala Val Gly Cys iBr Pro Leu Cys Ser Met Asp Val Ai^Hbu Pro Ser 

130 135 140 



Pro Asp Cys Pro Phe Pro Arg Arg Val Lys Leu Pro Gly Lys Cys Cys 
145 150 155 160 

Glu Glu Trp Val Cys Asp Glu Pro Lys Asp Arg Thr Ala Val Gly Pro 
165 170 175 

Ala Leu Ala Ala Tyr Arg Leu Glu Asp Thr Phe Gly Pro Asp Pro Thr 
180 185 190 

Met Met Arg Ala Asn Cys Leu Val Gin Thr Thr Glu Trp Ser Ala Cys 
195 200 205 

Ser Lys Thr Cys Gly Met Gly lie Ser Thr Arg Val Thr Asn Asp Asn 
210 215 . 220 

Thr Phe Cys Arg Leu Glu Lys Gin Ser Arg Leu Cys Met Val Arg Pro 
225 230 235 240 

Cys Glu Ala Asp Leu Glu Glu Asn lie Lys Lys Gly Lys Lys Cys lie 
245 250 255 

Arg Thr Pro Lys lie Ala Lys Pro Val Lys Phe Glu Leu Ser Gly Cys 

260 265 270 

Thr Ser Val Lys Thr Tyr Arg Ala Lys Phe Cys Gly Val Cys Thr Asp 
275 280 285 

Gly Arg Cys Cys Thr Pro His Arg Thr Thr Thr Leu Pro Val Glu Phe 
290 295 300 

Lys Cys Pro Asp Gly Glu lie Met Lys Lys Asn Met Met Phe lie Lys 
305 310 315 320 

Thr Cys Ala Cys His Tyr Asn Cys Pro Gly Asp Asn Asp lie Phe Glu 
325 330 335 



Ser Leu Tyr Tyr Arg Lys Met Tyr Gly Asp Met Ala 
340 345 



<210> 5 

<211> 2345 

<212> DNA 

<213> Rattus norvegicus 
<220> 

<223> rat connective tissue growth factor (CTGF) homolog 
cDNA 



<220> 
<221> CDS 

<222> (225) . . (1268) 
<223> CTGF homolog 



5 



<400> 5 

cacagctctt ctct^^|ga agactcagcc 
cgacctcctc caga^^Bag cagccccagc 
gagcgtccgg acaccaacct ccgccccgag 
tcgcctctgc accccgctgt gcgtcctcct 
cgggtcccgt tagcctcgcc ttggtgctcc 
actgcagcgc gcagtgtcag tgcgcagctg 
gcctggtgct ggacggctgc ggctgctgcc 
gcacggagcg tgatccctgc gacccacaca 
ccaaccgcaa gattggcgtg tgcactgcca 
ccgtgtaccg cagcggcgag tccttccaaa 
atggggccgt gggctgtgtg cccctgtgca 
gccccttccc gagaagggtc aagctgcccg 
agcccaagga ccgcacagtg gttggccctg 
ttggccctga ccdaactatg atgcgagcca 
cctgttctaa gacctgtggg atgggcatct 
gcaggctgga gaagcagagt cgtctctgca 
aaaacattaa gaagggcaaa aagtgcatcc 
ttgagctttc tggctgcacc agtgtgaaga 
cggacggccg ctgctgcaca ccgcacagaa 
ccgatggcga gatcatgaaa aagaacatga 
actgtcccgg ggacaatgac atctttgagt 
tggcgtaaag ccagggagta agggacacga 
acatctcatt ttcttctgta aaaaaacaaa 
tgggttccta actgctgtgg gagaaaacac 
catgcaaata gcctgtcaat ctcagacact 
ttcactagcg cacagtgaca gaacgcacac 
ccaggagaaa gacaggtact agctgaggtc 
ggagtgtgac aggggaggga cattatagct 
ctgggtgtgt gtcctccact cggtgaggct 
agtttcagcg ctgacattct gattccagtg 
tagactggac agcttgtggc aagtgaattt 
tgtaaatatt gtggataaat atatatattt 
ttgtgcctat tgttcttgtt ttaagtgctt 
ccaaacacca tcgataggac ataaagcttg 
gtggaaactg taacctgagt gactgtctgt 
atggatcaga agtcagattt ctagtaggaa 
ctttattaag aaatggcttg ctcagggtaa 
tgcttctttg actatgactg gtttgggagg 
ttacatgttt gcacctttct agttgaaaat 
aaaaa 



agacccactc cagctc^^ac cctaggagac 60 
ccagtggaca acccc^^^^ ccaccacctg 120 

accgagtcca ggctc^JK gcgcccctcg 180 
gccgcgcccc gaccatgctc gcctccgtcg 24 0 

tcctctgcac ccggcctgcc accggccagg 300 
aagcggcgcc gcgctgcccc gccggcgtga 360 
gcgtctgcgc caagcagctg ggagaactgt 420 
agggtctctt ctgcgacttc ggctcccccg 480 
aagatggtgc accctgtgtc ttcggtgggt 540 
gcagttgcaa ataccagtgc acttgcctgg 600 
gcatggacgt gcgcctgccc agccctgact 660 
ggaaatgctg tgaggagtgg gtgtgtgatg 720 
ccctagctgc ctaccgactg gaagacacat 780 
actgcctggt ccagaccaca gagtggagcg 840 
ccacccgggt taccaatgac aataccttct 900 
tggtcaggcc ctgtgaagct gacctagagg 960 
ggacgcctaa aattgccaag cctgtcaagt 1020 
cctaccgggc taagttctgt ggggtgtgca 1080 
ccaccacact gccggtggag ttcaagtgcc 1140 
tgttcatcaa gacctgtgcc tgccattaca 1200 
ccttgtacta caggaagatg tatggagaca 1260 
actcatttag actataactt gaactgagtt 1320 
aaggattaca gtagcacatt aatttaaatc 1380 
cccaccgaag tgagaaccgt gtgtcattgt 1440 
ggtttcgaga cagtttagac ttgacagttg 1500 
taaggtgagc ctcctggaag agtggagatg 1560 
attttaaaag cagcgatatg cctacttttt 1620 
tgcttgcaga cagacctgct ctagcaagag 1680 
gaagccagct attctttcag taagaacagc 1740 
acactggtcg ggagtcagaa ccttgtctat 1800 
gccggtaaca agccagattt ttatggatct 1860 
gtacagttat ctaagttaat ttaaagacgt 1920 
ttggaatttt taaactgata gcctcaaact 1980 
tctgtgattc aaaacaaagg agatactgca 2040 
cagaacatat ggtacgtaga cggtaaagca 2100 
atgtaaaatc actgttggcg aacaaatggc 2160 
ctggtcagat ttccacgagg aagtgtttgc 2220 
cagtttattt gttgagagtg tgaccaaaag 2280 
aaagtatata tatttttata aaaaaaaaaa 2340 

2345 



<210> 6 
<211> 347 
<212> PRT 

<213> Rattus norvegicus 
<220> 

<223> rat connective tissue growth factor (CTGF) homolog 
<400> 6 

Met Leu Ala Ser Val Ala Gly Pro Val Ser Leu Ala Leu Val Leu Leu 
15 10 15 

Leu Cys Thr Arg Pro Ala Thr Gly Gin Asp Cys Ser Ala Gin Cys Gin 
20 25 30 

Cys Ala Ala Glu Ala Ala Pro Arg Cys Pro Ala Gly Val Ser Leu Val 
35 40 45 



6 



Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu Gly Glu 
50 _ 55 60 

Leu Cys Thr Glu^Bf Asp Pro Cys Asp Pro His Lys G^^Beu Phe Cys 

65 70 75 80 

Asp Phe Gly Ser Pro Ala Asn Arg Lys lie Gly Val Cys Thr Ala Lys 

85 90 95 

Asp Gly Ala Pro Cys Val Phe Gly Gly Ser Val Tyr Arg Ser Gly Glu 
100 105 110 

Ser Phe Gin Ser Ser Cys Lys Tyr Gin Cys Thr Cys Leu Asp Gly Ala 
115 120 125 

Val Gly Cys Val Pro Leu Cys Ser Met Asp Val Arg Leu Pro Ser Pro 
130 135 140 

Asp Cys Pro Phe Pro Arg Arg Val Lys Leu Pro Gly Lys Cys Cys Glu 
145 150 155 160 

Glu Trp Val Cys Asp Glu Pro Lys Asp Arg Thr Val Val Gly Pro Ala 
165 170 175 

Leu Ala Ala Tyr Arg Leu Glu Asp Thr Phe Gly Pro Asp Pro Thr Met 
180 185 190 

Met Arg Ala Asn Cys Leu Val Gin Thr Thr Glu Trp Ser Ala Cys Ser 
195 200 205 

Lys Thr Cys Gly Met Gly lie Ser Thr Arg Val Thr Asn Asp Asn Thr 
210 215 220 

Phe Cys Arg Leu Glu Lys Gin Ser Arg Leu Cys Met Val Arg Pro Cys 
225 230 235 240 

Glu Ala Asp Leu Glu Glu Asn lie Lys Lys Gly Lys Lys Cys lie Arg 
245 250 255 

Thr Pro Lys lie Ala Lys Pro Val Lys Phe Glu Leu Ser Gly Cys Thr 
260 265 270 

Ser Val Lys Thr Tyr Arg Ala Lys Phe Cys Gly Val Cys Thr Asp Gly 
275 280 285 

Arg Cys Cys Thr Pro His Arg Thr Thr Thr Leu Pro Val Glu Phe Lys 
290 295 300 

Cys Pro Asp Gly Glu lie Met Lys Lys Asn Met Met Phe He Lys Thr 
305 310 315 320 

Cys Ala Cys His Tyr Asn Cys Pro Gly Asp Asn Asp He Phe Glu Ser 
325 330 335 

Leu Tyr Tyr Arg Lys Met Tyr Gly Asp Met Ala 
340 345 
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<210> 7 
<211> 3085 
<212> DNA 
<213> Homo sapi 



<220> 

<223> human transforming growth factor-beta (TGFB) 
inducible early growth response (TIEG) cDNA 

<220> 
<221> CDS 

<222> (320) .". (1729) 
<223> TIEG 



<400> 7 

ctcacgctct ctttccctgc cgcctgcctt 
ggggctgtta cacacacgcg cgctgtccat 
gcaattatat tgttagcgat atttcaagag 
gaaggcatca tgcatttttg aaaaactagt 
tgttttttgg tatttttaca caatctctac 
ttaatggaaa ataggagaaa tggaggaaag 
gagtatgtat tcctggaaca aaactgcaga 
tatgtcaatg agctgcagtt ggaagtctga 
tacaccagta tctgatttgt cagaggaaga 
tacaatccca gcattttgtt tgactccacc 
agtgtcaaat ctgatggcac cagcgccatc 
tgccaaacct cacattgccg cacctttcaa 
caaactcccc aaagctcagg caacaagtgt 
taaccaccag acctgcccaa tgaaagcagc 
tagaagaaga acccacctaa atgttgaggc 
gtcaccaaac agatccaaat gtgagagaaa 
tgctgcactt tatgactttt ctgtgccttc 
agcccctgtg tccccacaac agaagtcagt 
gggagtgcca cctatgccgg tcatctgcca 
tgtgacaaca gtcgttccca gcactcctcc 
tgtgttcatg ggcacacaag tccccaaagg 
tgtgcagagt tcaaagcctc cggtggtgag 
ccctgctcct gggttttccc cttcagcagc 
gataaggagt cacatctgta gccacccagg 
tctgaaggcc cacacgagga cgcacacagg 
ttgtgaaagg aggtttgccc gttctgatga 
tgagaagaaa tttgcgtgcc ccatgtgtga 
caagcatgcc cggcgccatc tatcagccaa 
caagctaaat gacattgctc tacctccaac 
gtgaagagtc agaactaact ttggtctcag 
cactgcaagt ctgtggcccc acaacgtggg 
aaggccccgt ctgggttagg tgactaaaag 
caggtttcat ttcttatcac ataagagaga 
ttgaaggttt cagatgaggt caacacaggt 
gttactttac ttttgctgtt tatacttgag 
actggtttca agaatatgga ggctggaagg 
gtaaaatgag tttgtattat tacaaatatt 
tattcctagt ctttccagtc aacatcgtgg 
atttttacac tattgtgaat atttggaatt 
agaattggaa tcctccttaa tttaattgct 
ttttgttttg aaagtttaac aaatgactgt 
agtttgcctg cagtttcttg tgtagatttg 
ctctaagata cactgcactt tgtttagaaa 
gaagggatat taagaatctt agataacttc 
gtacctttat gttatgagga tataatgtgt 
tattcacagg tggacaaatg ttgtcctgtt 



tcttttttcc tttttttgta ttggcgtctt 60 
tgcagcttac ataaaggcgg gcgcgattat 120 
caatggctcg ttttcttagg atttcaacac 180 
attgagaata ataccttgca acgtaaagaa 240 
tttgaccaaa cgagtctgga cagttttctt 300 
aatggaaatg atttctgaaa ggccaaaaga 360 
gaaaagtgat tttgaagctg. tagaagcact 4 20 
ttttaagaaa tacgttgaaa acagacctgt 4 80 
gaatctgctt ccgggaacac ctgattttca 540 
ttacagtcct tctgactttg aaccctctca 600 
tactgtacac ttcaagtcac tctcagatac 660 
agaggaagaa aagagcccag tatctgcccc 720 
gattcgtcat acagctgatg cccagctatg 780 
cagcatcctc aactatcaga acaattcttt 840 
tgcaagaaag aacataccat gtgccgctgt 900 
cacagtggca gatgttgatg agaaagcaag 960 
ctcagagacg gtcatctgca ggtctcagcc 1020 
gttggtctct ccacctgcag tatctgcagg 1080 
gatggttccc cttcctgcca acaaccctgt 1140 
cagccagcca ccagccgttt gcccccctgt 1200 
cgctgtcatg tttgtggtac cccagcccgt 1260 
cccgaatggc accagactct ctcccattgc 1320 
aaaagtcact cctcagattg attcatcaag 1380 
atgtggcaag acatacttta aaagttccca 1440 
agaaaagcct ttcagctgta gctggaaagg 1500 
actgtccaga cacaggcgaa cccacacggg 1560 
ccggcggttc atgaggagtg accatttgac 1620 
gaagctacca aactggcaga tggaagtgag 1680 
ccctgctccc acacagtgac agaccggaaa 174 0 
cgggagccag tggtgatgta aaaatgcttc 1800 
cttaaagcag aagccccaca gcctggcacg 1860 
ggcttcggcc acaggcaggt cacagaaagg 1920 
tgagaaagct tttattcctt tgaatatttt 1980 
agcacagatt ttgaatctgt gtgcatattt 204 0 
accaactttt caatgtgatt cttctaaagc 2100 
aaataaacat tacggtacag acatggagat 2160 
gtcatctttt tctagagtta tcttctttat 2220 
atgtagtgat taaatatatc tagaactatc 2280 
gaacgactgt atattgctaa gagggcccaa 2340 
ttgaagcata gctacaattt gtttttgcat 2400 
atctaggcat ttcattatgc tttgaacttt 2460 
aaaattgtat accaatgtgt tttctgtaga 2520 
aaaaactgaa gatgaaatat atattgtaaa 2580 
ttgaaaaaga tggcttatgt catcagtaaa 2640 
gctttattga attagaaaat tagtgaccat 2700 
aatttatagg agttttttgg ggatgtggag 2760 
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gtagttgggt agaaaaatta ttagaacatt cacttttgtt aacagtattt ctcttttatt 2820 
ctgttatata gtggatoata tacacagtgg caaaacaaaa gtacattgct taaaatatat 2880 
agtgaaaaat gtc^H^at cttcccattt aacattgttt ttgtat^jtg ggtgtagatt 2940 
tctgacatca aaac^JS^^ ccttggaaaa caaaagtttt aatta^^Ba aatccttgtg 3000 
acttacaatt tgcacaatat ttcttttgtt gtactttata tcttg^ffac aataaagaat 3060 
tccctttggt aaaaaaaaaa aaaaa ' 3085 



<210> 8 

<211> 469 

<212> PRT 

<213> Homo sapiens 



<220> 

<223> human ttansf orming growth factor-beta (TGFB) 
inducible early growth response (TIEG) 

<400> 8 

Met Glu Glu Arg Met Glu Met lie Ser Glu Arg Pro Lys Glu Ser Met 
1 5 . 10 15 

Tyr Ser Trp Asn Lys Thr Ala Glu Lys Ser Asp Phe Glu Ala Val Glu 
20 25 30 

Ala Leu Met Ser Met Ser Cys Ser Trp Lys Ser Asp Phe Lys Lys Tyr 

35 40 45 

Val Glu Asn Arg Pro Val Thr Pro Val Ser Asp Leu Ser Glu Glu Glu 
50 55 60 

Asn Leu Leu Pro Gly Thr Pro Asp Phe His Thr lie Pro Ala Phe Cys 
65 70 75 80 

Leu Thr Pro Pro Tyr Ser Pro Ser Asp Phe Glu Pro Ser Gin Val Ser 
85 90 95 

Asn Leu Met Ala Pro Ala Pro Ser Thr Val His Phe Lys Ser Leu Ser 
100 105 110 

Asp Thir Ala Lys Pro His lie Ala Ala Pro Phe Lys Glu Glu Glu Lys 
115 120 125 

Ser Pro Val Ser Ala Pro Lys Leu Pro Lys Ala Gin Ala Thr Ser Val 

130 135 140 

He Arg His Thr Ala Asp Ala Gin Leu Cys Asn His Gin Thr Cys Pro 
145 150 155 160 

Met* Lys Ala Ala Ser He Leu Asn Tyr Gin Asn Asn Ser Phe Arg Arg 
165 170 175 

Arg Thr His Leu Asn Val Glu Ala Ala Arg Lys Asn He Pro Cys Ala 
180 185 190 

Ala Val Ser Pro Asn Arg Ser Lys Cys Glu Arg Asn Thr Val Ala Asp 
195 200 205 

Val Asp Glu Lys Ala Ser Ala Ala Leu Tyr Asp Phe Ser Val Pro Ser 
210 215 220 
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Ser Glu Thr Val lie Cys Arg Ser Gin Pro Ala Pro Val Ser Pro^Gln 
225 • _ 230 235 240 

Gin Lys Ser Val^^ Val Ser Pro Pro Ala Val Ser A^^ly Gly Val 

245 250 255 

Pro Pro Met Pro Val lie Cys Gin Met Val Pro Leu Pro Ala Asn Asn 
260 265 270 

Pro Val Val Thr Thr Val Val Pro Ser Thr Pro Pro Ser Gin Pro Pro 
275 280 285 

Ala Val Cys Pro Pro Val Val Phe Met Gly Thr Gin Val Pro Lys Gly 
290 295 300 

Ala Val Met Phe Val Val Pro Gin Pro Val Val Gin Ser Ser Lys Pro 
305 310 315 320 

Pro Val Val Ser Pro Asn Gly Thr Arg Leu Ser Pro He Ala Pro Ala 
325 330 335 

Pro Gly Phe Ser Pro Ser Ala Ala Lys Val Thr Pro Gin He Asp Ser 
• 340 345 350 

Ser Arg lie Arg Ser His He Cys Ser His Pro Gly Cys Gly Lys Thr 
355 360 365 

Tyr Phe Lys Ser Ser His Leu Lys Ala His Thr Arg Thr His Thr Gly 
370 375 380 

Glu Lys Pro Phe Ser Cys Ser Trp Lys Gly Cys Glu Arg Arg Phe Ala 
385 390 395 400 

Arg Ser Asp Glu Leu Ser Arg His Arg Arg Thr His Thr Gly Glu Lys 
405 410 415 

Lys Phe Ala Cys Pro Met Cys Asp Arg Arg Phe Met Arg Ser Asp His 
420 425 430 

Leu Thr Lys His Ala Arg Arg His Leu Ser Ala Lys Lys Leu Pro Asn 
435 440 445 

Trp Gin Met Glu Val Ser Lys Leu Asn Asp He Ala Leu Pro Pro Thr 
450 455 460 

Pro Ala Pro Thr Gin 
465 



<210> 9 
<211> 3045 
<212> DNA 

<213> Mus musculus 
<220> 

<223> mouse transforming growth factor-beta (TGFB) 

inducible early growth response (TIEG) homolog 
cDNA 
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<220> 

<221> CDS 

<222> (114) . , (1 

<223> TIEG homo 

<400> 9 

cagacagcgc ggagcgcggt ggcgtcgacg tctagtgtct cagtgctccc gtctgtggct 60 
aactaagcag ccagcagcca ggcagctcgc gacctgcggc caggcagcca accatgctca 120 
acttcggcgc ttctctccag caagcttcgg aggggaaaat ggaactaatt tctgaaaagc 180 
ccagagaggg gatgcatccc tgggacaaag ctgagcagag tgactttgaa gcggtggaag 24 0 < 
cgctcatgtc catgagctgc gactggaagt ctcatttcaa gaaatacctt gaaaacaggc 300 
ctgtcacacc agtgtctgat acctccgagg atgacagctt gcttccaggg acgcctgacc 360 
ttcagacagt cccagcattt tgtttaacgc caccttacag cccctctgac ttcgaaccct 420 
cccaagggtc aaatctgact gcatcagcgc catctactgg ccacttcaaa tctttctccg 480 
atgctgccaa gcdtccaggc gccactcctt tcaaagagga ggaaaagaat cctttagctg 540 
cccctcctct tcctaaggct caagccacca gtgtcatccg tcacacagct gatgcccaac 600 
tgtgcaacca ccagtcctgc cccgtgaaag cagctagcat cctcaactat caggacaatt 660 
ctttccggag aagaacccac ggaaatgttg aggctactcg aaagaacata ccctgtgctg 720 
c«i9tgtcacc aaacagatcc aagcctgagc ccagcacagt gtccgatggt gatgagaagg 780 
cgggcgctgc actatatgac tttgctgtgc cttcctcaga gacagtaatt tgtaggtctc 840 
agccagctcc ttcgtcccca gtgcagaagt cagtactggt gtcttcacct acagtatcca 900 
ctgggggagt gccacccctg cctgtcatct gccagatggt tccccttcct gccaacaact 960 
ctcttgttag cacagttgtc cccagcactc ctcctagcca gccaccagct gtctgctcac 1020 
ctgtgttgtt catgggcact caggtgcctg agggcaccgt cgtgtttgtg gtaccccagc 1080 
ccgttgtgca gagcccaagg cctccagtgg tgagccccag tggcaccaga ctgtctccca 114 0 
ttgcccctgc tcctggattc tctccttcag cagcaagggt cactcctcag attgactcgt 1200 
ccagagtaag aagtcacatc tgtagccacc cagggtgtgg caagacttac tttaaaagtt 1260 
cccatctgaa ggcccacgtg aggacacaca caggggaaaa acctttcagc tgcagctgga 1320 
aaggctgtga aaggaggttt gctcgctccg atgaactgtc cagacaccgg cggacacaca 1380 
caggtgagaa gaagtttgcc tgtcccatgt gtgaccgtcg gtttatgagg agcgaccatt 1440 
taaccaagca tgcccgacgc cacctatcag ccaagaagct gccaaactgg caaatggaag 1500 
ttagcaagtt aaatgacatt gctctgcctc cgacccctgc ttccgcacag tgacggccag 1560 
aagatggaga cgcagaataa actttggtta gagtcaggag ccagtgatgg tgtcaagtgc 1620 
ttctgcaagg ctgtggccct ccaaaagggc ctaaagtaga agccctggcc tgggggaggc 1680 
cccgcctggg tgaaatgaca agaagtgctt cagccacagg caggtcacag aggacagggc 1740 
tcagttctta ccacagagag agaggagaac ccttttattc ctcccttatt ttagtctgga 1800 
aagtttcggc tgaggtgagc gcagcacagg ttttgaatca catacacatt ggggactttg 1860 
tttttgccat ttatacttga gaccagcttt gcagtgtgat tctttcaaag gattggtttc 1920 
aagaatatag aggctggaaa ttacggtaca gaaatggagc . tagaaaatga gtttgtgtta 1980 
cacagagatg tcatcttctc ctagagttat cttgtttctt attcctagtc tttccagtca 2040 
aatccgtgga tgtagctaag tatatctaaa actcattttt ccactattgt tggtatttga 2100 
agttgaacag ctgtacattg ttgtggggga gccaaaggat tggaaccctc attaatttaa 2160 
ttgcttggaa atgcagctaa aattcttctt tggcattttg ttttgaaagt ttaggcattt 2220 
tactctactt tagattttag tttgcttgca gttttttgtg tagatttgaa aattgtatac 2280 
caatgtgttt tctgtaggct taaaatacac tgcactttgt ttagaaaaaa atctggagat 2340 
gaaaatatgt attataaaga agagatgtca agaatttgag ataactcctt gagaaagttg 2400 
gctttatgtc atcagcaaag gacacttaac gtcaagcata cactgtggtt tttttgtttt 2460 
tttgtttttt tttttcaaat tagaaagttt aatgaccgtt acagatggac agtgtctttt 2520 
tatttatagg agtttttcag gatgtcagag tagataggta ggaaaattgt tattagaaca 2580 
ttcgcttcta ccttgaaaag gatgttaatg tggtcatgtt cttagcacca cagtgtctgg 2640 
gcatctggga aactccgaga cttttttaaa gtgtcatgat gtgatcacac ctgcagtttg 2700 
gggcatcgiaa tccagggcct tgcatgtctt ctgtaagagc tctcatcgct gacctgtatc 2760 
ccccgcaaga gcaatgactt ttgctaacag tatttctttt ctgttgtaaa gtggacagat 2820 
gatacacttg gtcgcaaagg taaattattc aaaatccaca gtgaaaacct caccacactt 2880 
tcccatttaa actatttcca tatctcagag gtttctgaca tgcaaacttg aacccttgaa 2940 
agaagagttt tcttaaaaat tataaaaaat cacgagttac aatttgcaca atattttttg 3000 
ttgaacttta taccttgttt acaataaaga cttttctttg gtata 3045 



<210> 10 
<211> 479 
<212> PRT 
<213> Mus muscu 



<220> 

<223> mouse transforming growth factor-beta (TGFB) 

inducible early growth response (TIEG) homolog 

<:400> 10 

Met Leu Asn Phe Gly Ala Ser Leu Gin Gin Ala Ser Glu Gly Lys Met 
15 10 15 

Glu Leu lie Ser Glu Lys Pro Arg Glu Gly Met His Pro Trp Asp Lys 
20 25 30 

Ala Glu Gin Ser Asp Phe Glu Ala Val Glu Ala Leu Met Ser Met Ser 
35 40 45 

Cys Asp Trp Lys Ser- His Phe Lys Lys Tyr Leu Glu Asn Arg Pro Val 
50 55 60 

Thr Pro Val Ser Asp Thr Ser Glu Asp Asp Ser Leu Leu Pro Gly Thr 
65 70 75 80 

Pro Asp Leu Gin Thr Val Pro Ala Phe Cys Leu Thr Pro Pro Tyr Ser 
85 90 95 

Pro Ser Asp Phe Glu Pro Ser Gin Gly Ser Asn Leu Thr Ala Ser Ala 
100 105 110 

Pro Ser Thr Gly His Phe Lys Ser Phe Ser Asp Ala Ala Lys Pro Pro 
115 120 125 

Gly Ala Thr Pro Phe Lys Glu Glu Glu Lys Asn Pro Leu Ala Ala Pro 
130 135 140 

Pro Leu Pro Lys Ala Gin Ala Thr Ser Val lie Arg His Thr Ala Asp 
145 150 155 160 

Ala Gin Leu Cys Asn His Gin Ser Cys Pro Val Lys Ala Ala Ser lie 
165 170 175 

Leu Asn Tyr Gin Asp Asn Ser Phe Arg Arg Arg Thr His Gly Asn Val 
180 185 190 

Glu Ala Thr Arg Lys Asn lie Pro Cys Ala Ala Val Ser Pro Asn Arg 
195 200 205 

Ser Lys Pro Glu Pro Ser Thr Val Ser Asp Gly Asp Glu Lys Ala Gly 
210 215 220 

Ala Ala Leu Tyr Asp Phe Ala Val Pro Ser Ser Glu Thr Val He Cys 
225 230 235 . 240 

Arg Ser Gin Pro Ala Pro Ser Ser Pro Val Gin Lys Ser Val Leu Val 

245 250 255 

Ser Ser Pro Thr Val Ser Thr Gly Gly Val Pro Pro Leu Pro Val lie 
260 265 270 
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Cys Gin Met Val Pro Leu ^ro Ala Asn Asn Ser Leu Val Ser Thr Val 
275 280 285 



m 



Val Pro Ser Thr^ib Pro Ser Gin Pro Pro Ala Val 
290 295 300 



m 



er Pro Val 



Leu Phe Met Gly Thr Gin Val Pro Glu Gly Thr Val Val Phe Val Val 
305 310 315 320 

Pro Gin Pro Val Val Gin Ser Pro Arg Pro Pro Val Val Ser Pro Ser 
325 ■ 330 335 

Gly Thr Arg Leu Ser Pro lie Ala Pro Ala Pro Gly Phe Ser Pro Ser 
340 345 350 

Ala Ala Arg Val Thr Pro Gin lie Asp Ser Ser Arg Val Arg Ser His 
355 360 365 

He Cys Ser His Pro Gly Cys Gly Lys Thr Tyr Phe Lys Ser Ser His 
370 375 . 380 

Leu Lys Ala His Val Arg Thr His Thr Gly Glu Lys Pro Phe Ser Cys 
385 390 395 400 

Ser Trp Lys Gly Cys Glu Arg Arg Phe Ala Arg Ser Asp Glu Leu Ser 
405 410 415 

Arg His Arg Arg Thr His Thr Gly Glu Lys Lys Phe Ala Cys Pro Met 
420 425 430 

Cys Asp Arg Arg Phe Met Arg Ser Asp His Leu Thr Lys His Ala Arg 
435 440. 445 

Arg His Leu Ser Ala Lys Lys Leu Pro Asn Trp Gin Met Glu Val Ser 
450 455 460 



Lys Leu Asn Asp He Ala Leu Pro Pro Thr Pro Ala Ser Ala Gin 
465 470 475 



<210> 11 
<211> 3115 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<223> rat transforming growth factor-beta (TGFB) 

inducible early growth response (TIEG) homolog 
cDNA 

<220> 
<221> CDS 

<222> (316) . . (1758) 
<223> TIEG homolog 

<400> 11 

ggagggaaca cggtgcctcg ggttgtgtac acgctccact gacagagctt cttgcagccg 60 
ggcagccgct gatcacgcgt ggccccgcca gcccattggc tgaggcctca cacacctttg 120 
. ccgctgattg gtcgatacca cgccccgcct tccaccccgc ccgcggcgcg gggcaggcag 180 
agcggctacc tgcatgggga gggggggcag acagcgcgga gcgcggtggc gtcgacgtct 240 
agtgtctcag tgctccgtct gtggctaggc agccagcagc caagcagctc gcgacctacg 300 
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gccaggcagc caaccatgct taacttcggc gcttctctcc agcaagcttc ggaggggaaa 360 
atggaactaa tttctmaaa gtccaaagag ggagcacatc cctgggacaa agctgaacag 420 
agtgactttg aagc^^^^a agcgctcatg tccatgagct gcgac^^« gtctcacttc 480 

aagaaatacc ttgaalircag gcctgtcaca ccagtgtctg ataccC^ga ggaagacagt 540 
ttgcttccag ggacgcctga ccttcagacc gtcccagcat tctgtttgac tccaccttac 600 
agcccctctg acttcgaacc ctcgcagggg tcaaatctga cggcaccagc gccacctact 660 
ggtcacttca gatcgctctc cgatgctgcc aagcctccca gcatcgctcc ttttaaagag 720 
gaggaaaaga gcccattagc tgcccctccc ctccctaagg ctcaagccac cagtgtcatc 780 
cgtcacacag ctgatgccca actctgtaac caccagtcct gccctgtgaa agcagctagc 840 
atcctcaact accaggacaa ttctttccgg agaagaaccc acataaatgt cgaggctact 900 
cgaaagaaca taccctgtgc tgctgtgtca ccaaacagac ccaagcctga gcccagcaca 960 
gcggcgaatg gcgctgagaa ggccggcact gcgccgtatg actttgctgt gccttcctca 1020 
gagacagtca tctgtaggtc ttctcagcca gctcccacgt ccccagtaca gaagtcagta 1080 
ctgatgtctt cacctacagt gtccactggg ggagtgccac ccttgcctgt catctgccag 1140 
atggttcccc ttdcagcaaa caactctctc gttaccacag ttgtccccag cagtccgccc 1200 
agccagccac ccgctgtctg ctcacccgtg ttgttcatgg gcactcaggt gcccaagggc 1260 
acggtcatgt ttgtggtacc ccagcccgtt gtacagagcc caaagcctcc agtggtgagc 1320 
cccaatggca ccagactgtc tcccattgcc cctgcgcctg gattctctcc ttcagcagca 1380 
agggttactc cccagattga ttcgtccaga gtaagaagtc acatctgtag ccacccagga 1440 
tgtggcaaga cttactttaa aagttcccat ctgaaggccc acgtgagaac acacacaggg 1500 
gaaaaacctt tcagctgcag ctggaaaggc tgtgaaagga ggtttgctcg ttccgatgaa 1560 
ctgtccagac accggcggac acacacaggt gagaagaaat ttgcctgtcc catgtgtgac 1620 
cgtcggttca tgagaagtga ccatttaacc aaacatgccc ggcgacacct atcagccaag 1680 
aagctgccaa actggcaaat ggaagtgagc aagttaaatg acatcgctct gcctccggcc 1740 
acagcgtcgg cacagtgacg gccagaagat ggagactcag aataaacttt ggtcagagtc 1800 
gagccggtga ccgtgtcaaa tgcttctgca agtctgcggc cctccaacag ggcctaaagc 1860 
agaagcccct ggcctggggg gaggcccagc ctgggttaga tgacgagaag tgcttcagcc 1920 
acaggcagtt cacagaaggg cggggttcat ttcttaccac gtagagagat gagaacactc 1980 
gtgtcccttg agtagttttg gaaggtttcg gatgagatga gcacagcaca ggttttgaat 2040 
cacatacaca tcaggtactt tgtttttgcc atttatactt gagaccagct ttgcagtgtg 2100 
attcttttaa agcgttggtt tcgagaatat agaggctgaa attacggtac agaaatgcag 2160 
ctagaaagtg agtttgtgtc atcttctcct agagttatct tgcttattat tcctagtctt 2220 
tacagtcaac. tccgtggacg tggctaagta tagaaaacta tcatttcttc cactgttgtt 2280 
ggtatttgag gttgaacagc tgtacattgc taaggggagc ccaaggattg gaactctcat 2340 
taatttaatt gcttggaagt gcagctaaag ttcttttttt tggccttttg ttttgaaagt 2400 
ttaggcattt tactctacct tagatattag tttgcttgca gttttttgtg tagatttgaa 2460 
aactgtacac caatgtgttt cctatagact taagatacgc tgcgctttgt ttagaaaaca 2520 
atctggcagt gaaaatatgt gtcgtaaaga agagatgtca agaatttgag atgactcctt 2580 
gagaaagatg gcttatgtca tcagcaaagg acacttacgt cagacgtaca ctgtttctat 2640 
caaatcagga agttagtgac catcacagat .ggacaggtgt ccttttattt ataggagttt 2700 
gtcaggatgt ctgagtagat aggtaggaaa ttgttactag aacattcact tccgacttga 2760 
caaggatatt catggtgcag atacgttctt cctgccgctg cagtgtctgg gtgtccggga 2820 
aactccgaga ctttaaagtg tcataatgtg gtcacacctg cagttctggg gatccagagc 2880 
cttgcttgtg ttctgtaagc tctgtcaccg agttgtatcc ccacctgaaa gcagtgactt 2940 
ttgctaacag tatttctgtt ctgttgtaaa gtggatagaa gacacacttg gtagtaaagg 3000 
taaattattc aaagtccaca gtgaaaaact caccatacct tcccatttaa accgtttctg 3060 
tatctcatag gtttctgaca tgcaaacttg aacccttgaa aaaaagagtt ttctt 3115 



<210> 12 
<211> 480 
<212> PRT 

<213> Rattus norvegicus 
<220> 

<223> rat transforming growth factor-beta (TGFB) 

inducible early' growth response (TIEG) homolog 

<400> 12 

Met Leu Asn Phe Gly Ala Ser Leu Gin Gin Ala Ser Glu Gly Lys Met 
15 10 15 
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Glu Leu He Ser Glu Lys Ser Lys Glu Gly Ala His Pro Trp Asp Lys 
20^^ 25 

Ala Glu Gin Ser Phe Glu Ala Val Glu Ala Leu M^^^r Met Ser 

35 40 45 

Cys Asp Trp Lys Ser His Phe Lys Lys Tyr Leu Glu Asn Arg Pro Val 
50 55 60 

Thr Pro Val Ser Asp Thr Ser Glu Glu Asp Ser Leu Leu Pro Gly Thr 
65 70 75 80 

Pro Asp Leu Gin Thr Val Pro Ala Phe Cys Leu Thr Pro Pro Tyr Ser 
85 90 95 

Pro Ser Asp Phe Glu Pro Ser Gin Gly Ser Asn Leu Thr. Ala Pro Ala 
100 105 110 

Pro Pro Thr Gly His Phe Arg Ser Leu Ser Asp Ala Ala Lys Pro Pro 
115 120 125 

Ser He Ala Pro Phe Lys Glu Glu Glu Lys Ser Pro Leu Ala Ala Pro 
130 135 140 

Pro Leu Pro Lys Ala Gin Ala Thr Ser Val He Arg His Thr Ala Asp 
145 150 155 160 

Ala Gin Leu Cys Asn His Gin Ser Cys Pro Val Lys Ala Ala Ser He 
165 170 175 

Leu Asn Tyr Gin Asp Asn Ser Phe Arg Arg Arg Thr His He Asn Val 
180 185 190 

Glu Ala Thr Arg Lys Asn He Pro Cys Ala Ala Val Ser Pro Asn Arg 
195 200 205 

Pro Lys Pro Glu Pro Ser Thr Ala Ala Asn Gly Ala Glu Lys Ala Gly 
210 215 220 

Thr Ala Pro Tyr Asp Phe Ala Val Pro Ser Ser Glu Thr Val He Cys 
225 230 235 240 

Arg Ser Ser Gin Pro Ala Pro Thr Ser Pro Val Gin Lys Ser Val Leu 
245 250 255 



Met Ser Ser Pro Thr Val Ser Thr 
260 

He Cys Gin Met Val Pro Leu Pro 
275 280 

Val Val Pro Ser Ser Pro Pro Ser 
290 295 

Val Leu Phe Met Gly Thr Gin Val 
305 3ia 

Val Pro Gin Pro Val Val Gin Ser 
325 



Gly Gly Val Pro Pro Leu Pro Val 
265 270 

Ala Asn Asn Ser Leu Val Thr Thr 
285 

Gin Pro Pro Ala Val Cys Ser Pro 
300 

Pro Lys Gly Thr Val Met Phe Val 

315 320 

Pro Lys Pro Pro Val Val Ser Pro 
330 335 
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Asn Gly Thr Arg Leu Ser Pro lie Ala Pro Ala Pro Gly Phe Ser Pro 
340,^^ 345 _350 



Ser Ala Ala Arg^n Thr Pro Gin lie Asp Ser Ser A^^^al Arg Ser 

355 360 365 




His lie Cys Ser His Pro Gly Cys Gly Lys Thr Tyr Phe Lys Ser Ser 
370 375 380 

His Leu Lys Ala His Val Arg Thr His Thr Gly Glu Lys Pro Phe Ser 
385 390 395 400 

Cys Ser Trp Lys Gly Cys Glu Arg Arg Phe Ala Arg Ser Asp Glu Leu 
405 410 415 

Ser Arg His Arg Arg Thr His Thr Gly Glu Lys Lys Phe Ala Cys Pro 
420 425 430 

. Met Cys Asp Arg Arg Phe Met Arg Ser Asp His Leu Thr Lys His Ala 
435 440 445 

Arg Arg His Leu Ser Ala Lys Lys Leu Pro Asn Trp Gin Met Glu Val 
450 455 460 



Ser Lys Leu Asn Asp lie Ala Leu Pro Pro Ala Thr Ala Ser Ala Gin 
465 470 475 480 



<210> 13 
<211> 2872 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> human transforming growth factor-beta (TGFB) 
inducible early growth response (TIEG) splice 
variant cDNA 

<220> 

<221> CDS 

<222> (87) . . (1529) 

<223> TIEG splice variant 

<400> 13 ■ 

gaattcggca cgagcgcccg tctgtggcca agcagccagc agcctagcag ccagtcagct 60 
tgccgccggc ggccaagcag ccaaccatgc tcaacttcgg tgcctctctc cagcagactg 120 
cggaggaaag aatggaaatg atttctgaaa ggccaaaaga gagtatgtat tcctggaaca 180 
aaactgcaga gaaaagtgat tttgaagctg tagaagcact tatgtcaatg agctgcagtt 240 
ggaagtctga ttttaagaaa tacgttgaaa acagacctgt tacaccagta tctgatttgt 300 
cagaggaaga gaatctgctt ccgggaacac ctgattttca tacaatccca gcattttgtt 360 
tgactccacc ttacagtcct tctgactttg aaccctctca agtgtcaaat ctgatggcac 420 
cagcgccatc tactgtacac ttcaagtcac tctcagatac tgccaaacct cacattgccg 4 80 
cacctttcaa agaggaagaa aagagcccag tatctgcccc caaactcccc aaagctcagg 540 
caacaagtgt gattcgtcat acagctgatg cccagctatg taaccaccag acctgcccaa 600 
tgaaagcagc cagcatcctc aactatcaga acaattcttt tagaagaaga acccacctaa 660 
atgttgaggc tgcaagaaag aacataccat gtgccgctgt gtcaccaaac agatccaaat 720 
gtgagagaaa cacagtggca gatgttgatg agaaagcaag tgctgcactt tatgactttt 780 
ctgtgccttc ctcagagacg gtcatctgca ggtctcagcc agcccctgtg tccccacaac 840 
agaagtcagt gttggtctct ccacctgcag tatctgcagg gggagtgcca cctatgccgg 900 
tcatctgcca gatggttccc cttcctgcca acaaccctgt tgtgacaaca gtcgttccca 960 
gcactcctcc cagccagcca ccagccgttt gcccccctgt tgtgttcatg ggcacacaag 1020 
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tccccaaagg 

cggtggtgag 

cttcagcagc 

gccacccagg 

cgcacacagg 

gttctgatga 

ccatgtgtga 

tatcagccaa 

tacctccaac 

ttggtctcag 

acaacgtggg 

tgactaaaag 

ataagagaga 

caacacaggt 

tatacttgag 

agctggaagg 

tacaaatatt 

aacatcgtgg 

atttggaatt 

tttaattgct 

aaatgactgt 

tgtagatttg 

tgtttagaaa 

agataacttc 

tataatgtgt 

ttcgtcctgt 

attagaacat 

atacacagtg 

tcttcccatt 

cccttggaaa 

tttcttttgt 



cgctgtcatg 
cccgaatqgc 
aaaa^^Kct 
atgtlBaag 

agaaaagcct 
actgtccaga 
ccggcggttc 
gaagctacca 
ccctgctccc 
cgggagccag 
cttaaagcag 
ggcttcggcc 
tgagaaagct 
agcacagatt 
accaactttt 
aaataaacat 
gtcatctttt 
atgtagtgat 
gaacgactgt 
ttgaagcata 
atctaggcat 
aaaattgtat 
aaaaactgaa 
ttgaaaaaga 
gctttattga 
taatttatag 
tcacttttgt 
gcaaaacaaa 
taacattgtt 
acaaaagttt 
tgtactttat 



tttgtggtac 

accagactct 

cctcagattg 

acatacttta 

ttcagctgta 

cacaggcgaa 

atgaggagtg 

aactggcaga 

acacagtgac 

tggtgatgta 

aagccccaca 

acaggcaggt 

tttattcctt 

ttgaatctgt 

caatgtgatt 

tacggtacag 

tctagagtta 

taaatatatc 

atattgctaa 

gctacaattt 

ttcattatgc 

accaatgtgt 

gatgaaatat 

tggcttatgt 

attagaaaat 

gagttttttg 

taacagtatt 

agtacattgc 

tttgtatatt 

taattaaaaa 

atcttgttta 



cccagcccgt 

ctcccattgc 

attcatcaag 

aaagttccca 

gctggaaagg 

cccacacggg 

accatttgac 

tggaagtgag 

agaccggaaa 

aaaatgcttc 

gcctggcacg 

cacagaaagg 

tgaatatttt 

gtgcatattt 

cttctaaagc 

acatggagat 

tcttctttat 

tagaactatc 

gagggcccaa 

gtttttgcat 

tttgaacttt 

tttctgtaga 

atattgtaaa 

catcagtaaa 

tagtgaccat 

gggatgtgga 

tctcttttat 

ttaaaatata 

gggtgtagat 

aaatccttgt 

caataaagaa 



tgtgcagagt 
ccctgctcct 
gataaj 
tctgal 
ttgtgaaagg 
tgagaagaaa 
caagcatgcc 
caagctaaat 
gtgaagagtc 
cactgcaagt 
aaggccccgc 
caggtttcat 
ttgaaggttt 
gttactttac 
actggtttca 
gtaaaatgag 
tattcctagt 
atttttacac 
agaattggaa 
ttttgttttg 
agtttgcctg 
ctctaagata 
gaagggatat 
gtacctttat 
tattcacagg 
ggtagttggg 
tctgttatat 
tagtgaaaaa 
ttctgacatc 
gacttacaat 
ttccctttgg 



tcaaagcctc 
gggttttccc 
cacatctgta 
cacacgagga 
aggtttgccc 
tttgcgtgcc 
cggcgccatc 
gacattgctc 
agaactaact 
ctgtggcccc 
ctgggttagg 
ttcttatcac 
cagatgaggt 
ttttgctgtt 
agaatatgga 
tttgtattat 
ctttccagtc 
tattgtgaat 
tcctccttaa 
aaagtttaac 
cagtttcttg 
cactgcactt 
taagaatctt 
gttatgagga 
tggacaaatg 
tagaaaaatt 
agtggatgat 
tgtcactata 
aaaacttgga 
ttgcacaata 
ca 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2872 



<210> 14 
<211> 480 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> human transforming growth factor-beta (TGFB) 
inducible early growth response (TIEG) splice 
variant 

<400> 14 

Met Leu Asn Phe Gly Ala Ser Leu Gin Gin Thr Ala Glu Glu Arg Met 
1 5 10 15 

Glu Met lie Ser Glu Arg Pro. Lys Glu Ser Met Tyr Ser Trp Asn Lys 
20 25 30 

Thr Ala Glu Lys Ser Asp Phe Glu Ala Val Glu Ala Leu Met Ser Met 
35 40 45 

Ser Cys Ser Trp Lys Ser Asp Phe Lys Lys Tyr Val Glu Asn Arg Pro 
50 55 60 

Val Thr Pro Val Ser Asp Leu Ser Glu Glu Glu Asn Leu Leu Pro Gly 
65 70 75 80 



Thr Pro Asp Phe His Thr lie Pro Ala Phe Cys Leu Thr Pro Pro Tyr 
85 90 95 
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Ser Pro Ser Asp Phe Glu Pro Ser Gin Val Ser Asn Leu Met Ala Pro 
100^^ 105 110 

Ala Pro Ser Thr^BF His Phe Lys Ser Leu Ser Asp Ti^Hla Lys Pro 

115 120 12'5 

His lie Ala Ala Pro Phe Lys Glu Glu Glu Lys Ser Pro Val Ser Ala 
130 135 140 

Pro Lys Leu Pro Lys Ala Gin Ala Thr Ser Val lie Arg His Thr Ala 
145 150 155 160 

Asp Ala Gin Leu Cys Asn His Gin Thr Cys Pro Met Lys Ala Ala Ser 
165 170 175 

lie Leu Asn Tyr Gin Asn Asn Ser Phe Arg Arg Arg Thr His Leu Asn 
180 185 190 

Val Glu Ala Ala Arg Lys Asn lie Pro Cys Ala Ala Val Ser Pro Asn 
195 200 205 

Arg Ser Lys Cys Glu Arg. Asn Thr Val Ala Asp Val Asp Glu Lys Ala 
210 215 220 

Ser Ala Ala Leu Tyr Asp Phe Ser Val Pro Ser Ser Glu Thr Val lie 
225 230 235 240 

Cys Arg Ser Gin Pro Ala Pro Val Ser Pro Gin Gin Lys Ser Val Leu 

245 250 255 

Val Ser Pro Pro Ala Val Ser Ala Gly Gly Val Pro Pro Met Pro Val 
260 265 270 

lie Cys Gin Met Val Pro Leu Pro Ala Asn Asn Pro Val Val Thr Thr 
275 280 ^ 285 

Val Val Pro Ser Thr Pro Pro Ser Gin Pro Pro Ala Val Cys Pro Pro 
290. 295 300 

Val Val Phe Met Gly Thr Gin Val Pro Lys Gly Ala Val Met Phe Val 
305 310 315 320 

Val Pro Gin Pro Val Val Gin Ser Ser Lys Pro Pro Val Val Ser Pro 
325 330 335 

Asn Gly Thr Arg Leu Ser Pro lie Ala Pro Ala Pro Gly Phe Ser Pro 
340 345 350 

Ser Ala Ala Lys Val Thr Pro Gin lie Asp Ser Ser Arg lie Arg Ser 
355 360 365 

His He Cys Ser His Pro Gly Cys Gly Lys Thr Tyr Phe Lys Ser Ser 
370 375 380 

His Leu Lys Ala His Thr Arg Thr His Thr Gly Glu Lys Pro Phe Ser 
385 390 395 400 

Cys Ser Trp Lys Gly Cys Glu Arg Arg Phe Ala Arg Ser Asp Glu Leu 
405 410 415 
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Ser Arg His Arg Arg Thr His Thr Gly Glu Lys Lys Phe Ala Cys Pro 
420^^ 425 430 



Met Cys Asp Arg 
435 



Phe Met Arg Ser Asp His Leu l^HPys His Ala 

440 445 



Arg Arg His Leu Ser Ala Lys Lys Leu Pro Asn Trp Gin Met Glu Val 
450 455 460 



Ser Lys Leu Asn Asp lie Ala Leu Pro Pro Thr Pro Ala Pro Thr Gin 
465 470 475 480 



<210> 15 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :hexahistidine 
(His) affinity tag 

<400> 15 

His His His His His His 
1 5 



<210> 16 
<211> 200 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :poly~Gly 
flexible linker 

<220> 

<221> m6d_RES 
<222> (6) . . (200) 

<223> Gly residues from position 6 to 200 may be present 
or absent 

<400> 16 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
15 10 15 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
20 25 30 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
35 40 45 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
50 55 60 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
65 , 70 75 80 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
85 90 95 
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Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
100 105 _110 



Gly Gly Gly Gly^^ Gly Gly Gly Gly Gly Gly Gly GJ^rLy Gly Gly 
115 120 125 



Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
130 135 140 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
145 150 155 160 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
165 ^ 170 175 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
180 185 190 



Gly Gly Gly Gly Gly Gly Gly Gly 
195 200 
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2 (57) Abstract: Methods of Diagnosing «& Treating Diabetes and Insulin Resistance ABSTRACT OF THE DISCLOS 

invention provides compositions and methods for diagnosing and treating diabetes and insulin resistance. In particular, the inven- 
Q tion provides methods of identifying modulators of the polynucleotides or polypeptides of the invention and using those modulators 
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invention in a patient. 
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CROSS REFERENCE TO RELATED APPLICATIONS 
[01] The present application claims priority to U.S. Provisional Patent 
5 Application No. 60/386,085, filed June 4, 2002 and U.S. Provisional Patent Application No. 
60/386,331, June 5, 2002, each of which are incorporated by reference. 

BACKGROUND OF THE INVENTION 
[02] Diabetes mellitus can be divided into two clinical syndromes. Type 1 

10 and Type 2 diabetes mellitus. Type 1, or insulin-dependent diabetes mellitus (IDDM), is a 

chronic autoimmune disease characterized by the extensive loss of beta cells in the pancreatic 
Islets of Langerhans, which produce insulin. As these cells are progressively destroyed, the 
amount of secreted insulin decreases, eventually leading to hyperglycemia (abnormally high 
level of glucose in the blood) when the amount of secreted insulin drops below the level 

1 5 required for euglycemia (normal blood glucose level). Althougji the exact trigger for this 
immune response is not known, patients with IDDM have high levels of antibodies against 
proteins expressed in pancreatic beta cells. However, not all patients with high levels of 
these antibodies develop IDDM. 

[03] Type 2 diabetes (also referred to as non-insulin dependent diabetes 

20 mellitus (NIDDM)) develops when muscle, fat and liver cells fail to respond normally to 

insulin. This failure to respond (called insulin resistance) may be due to reduced numbers of 
insulin receptors on these cells, or a dysfunction of signaling pathways within the cells, or 
both. The beta cells initially compensate for this insulin resistance by increasing instdin 
ou^ut. Over time, these cells become unable to produce enough insulin to maintain normal 

25 glucose levels, indicating progression to Type 2 diabetes. 

[04] Type 2 diabetes is brought on by a combination of genetic and 
acquired risk factors - including a high-fat diet, lack of exercise, and aging. Worldwide, 
Type 2 diabetes has become an epidCTiic, driven by increases in obesity and a sedentary 
lifestyle, widespread adoption of western dietary habits, and the general aging of the 

30 population in many countries. In 1985, an estimated 30 million people worldwide had 

diabetes by 2000, this figure had increased 5-fold, to an estimated 154 million people. The 
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number of people withfJ^etes is expected to double between i^^d 2025, to about 300 
million. 

[05] Type 2 diabetes is a complex disease characterized by defects in 
glucose and lipid metabolism. Typically there are perturbations in many metabolic 
parameters iacluding increases in fasting plasma glucose levels, free fatty acid levels and. 
triglyceride levels, as well as a decrease in the ratio of HDL/LDL. As discussed above, one 
of the principal underlying causes of diabetes is thought to be an increase in insulin resistance 
in peripheral tissues, principally muscle and fat. 

[06] Therapies aimed at reducing peripheral insulin resistance are available. 
The most relevant to this invention are drags of the thiazolidinedione (TZD) class namely 
troglitazone, piogUtazone, and rosigUtazone. La the US these have been marketed imder the 
names Rezulin™, Avandia™ and Actos™, respectively. The principal effect of these drags is 
to improve glucose homeostasis. Notably in diabetics treated with TZDs there are increases 
in peripheral glucose disposal rates indicative of increased insulin sensitivity m both muscle 
and fat. 

[07] The molecular target of TZDs is a member of the PP AR family of 
Ugand-activated transcription factors called PPAR gamma. This transcription factor is highly 
expressed in adipose tissue with much lower levels being observed in muscle. Binding of 
TZDs to PPAR gamma in target cells and tissues such as fat and muscle brings about a 
change in gene expression. The link between TZD-altered gene expression in fat and muscle 
and increased insulin sensitivity is unknown. The present invention addresses this and other 
problems. 

BRffiF SUMMARY OF THE INVENTION 
[08] The present invention provides methods for identifying an agent for 
treating a diabetic or pre-diabetic individual. In some embodiments, the methods comprise 
the steps of: (i) contacting an agent to a mixture comprising a polypeptide encoded by a 
polynucleotide that hybridizes xmder stringent conditions to a nucleic acid encoding SEQ ID 
NO:2, SEQ ID N0:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12 or SEQ 
ID NO: 14; and (ii) selecting an agent that modulates the expression or activity of the 
polypeptide or that binds to the polypeptide, thereby identifying an agent for treatiag a 
diabetic or pre-diabetic individual. In some embodiments, the methods further comprise 
selecting an agent that modulates insulin sensitivity. 
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[09] uRome embodiments, step (ii) comprises^Prcting an agent that 
modulates expression of the polypeptide. In some embodimaits, step (ii) comprises selecting 
an agent that modulates the activity of the polypeptide. In some embodiments, step (ii) 
comprises selecting an agent that specifically binds to the polypeptide. In some 
5 embodiments, the polypeptide is expressed in a cell and the cell is contacted with the agent. 

[10] The present invention also provides methods of treating a diabetic or 
pre-diabetic animal. In some embodiments, the methods comprise adndinistering to the 
animal a therapeutically effective amount of an agent identified as described above. La some 
embodiments, tiie agent is an antibody. In some embodiments, the antibody is a monoclonal 
10 antibody. Li some embodiments, the animal is a human. 

[11] The present invention also provides methods of introducing an 
expression cassette into a cell. In some emodiments, the methods comprise introducing into 
the cell an expression cassette comprising a promoter operably Imked to a polynucleotide 
encoding a polypeptide, wherein the polynucleotide hybridizes under stringent conditions to a 
15 nucleic acid encoding SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID 
NO:10, SEQ ID NO:12 or SEQ ID NO:14. 

[12] In some embodiments, the cell is selected from the group consisting of 
an adipocyte and a skeletal muscle cell. 

[13] In some embodiments, the methods further comprising mtroducing the 
20 cell into a hmnan. In some embodiments, the human is diabetic. In some embodiments, the 
human is prediabetic. In some embodiments, the cell is from the human. 

[14] The present invention also provides methods of diagnosing an 
individual who has Type 2 diabetes or is prediabetic. In some embodiments, the method 
comprises, detecting in a sample from the individual the level of a polypeptide or the level of 
25 a polynucleotide encoding the polypeptide, wherein the polynucleotide hybridizes imder 
stringent conditions to a nucleic acid encoding an amino acid sequence selected from the 
group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID 
NO: 10, SEQ ID NO: 12 or SEQ ID NO: 14, wherein a modulated level of the polypeptide or 
polynucleotide in the sample compared to a level of the polypeptide or polynucleotide ia 
30 either a lean individual or a previous sample from the individual indicates that the individual 
is diabetic or prediabetic. 

[15] In some embodiments, the detecting step comprises contacting the 
sample with an antibody that specifically binds to the polypeptide. . 
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[16] ERome embodiments, the detecting step^Bfprises quantifying 
mRNA encoding the polypeptide. In some embodiments, the niRNA is reverse transcribed 
and amplified in a polymerase chain reaction. 

[17] In some embodiments, the sample is a blood, urine or tissue sample. 
5 [18] The present invention also provides isolated nucleic acids that 

hybridize under stringent conditions to a polynucleotide encoding a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID N0:2, SEQ ID NO:4, 
SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID N0:12 or SEQ ID NO:14. 

[19] In some embodiments, the nucleic acid is selected from the group 
10 consistmg of SEQ ID NO:l, SEQ ID Np:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, 
SEQ ID NO:ll or SEQ ID NO:13 

[20] The present invention also provides an expression cassette comprising 
a heterologous promoter operably linked to a polynucleotide that hybridizes und^ stringent 
conditions to a nucleic acid encoding a polypeptide having an amino acid sequence selected 
15 from the group consistmg of SEQ ID NO:2, SEQ ED NO:4, SEQ ID NO:6, SEQ ID NO:8, 
SEQ ID NO:10, SEQ ID N0:12 or SEQ ID NO:14. 

[21] The present invention also provides host cells transfected with a 
polynucleotide that hybridizes under stringent conditions to a nucleic acid encoding a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 
20 NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12 or SEQ 
ID NO: 14. In some embodiments, the host cell is a human cell. In some embodiments, the 
host cell is a bacterium. 

[22] The present invention also provides isolated polypeptides comprising 
an amino acid sequence at least 70% identical to SEQ ID NO: SEQ ID NO:2, SEQ ID NO:4, 
25 SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12 or SEQ ID NO:14. In some 
embodiments, tiie polypeptide is SEQ ID NO: SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12 or SEQ ID NO:14. 

DEFINITIONS 

30 [23] "Insulin sensitivity" refers to the ability of a cell or tissue to respond to 

insulin. Responses include, e.g., glucose uptake of a cell or tissue in response to insulin 
stimulation. Sensitivity can be determined at an organismal, tissue or cellular level. For 
example, blood or urine glucose levels following a glucose tolerance test are indicative of 
insulin sensitivity. Other methods of measuring insulin sensitivity include, e.g., measuring 
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glucose uptake (see, e.^^^arcia de Herreros, A., and Bimbaum^^J. J. Biol Chem. 264, 
19994-19999 (1989); KUp, A., Li, G., and Lx)gan, WJ- Am. X Physiol 247, E291-296 
(1984)), measuring the glucose infusion rate (GINF) into tissue such as the skeletal muscle 
(see, e,g„ Ludvik et a/., J. Clin, Invest. 100:2354 (1997); Frias et al. Diabetes Care 23:64, 
(2000)) and measuring sensitivity of GLUT4 translocation (e.g., as described herein) in 
response to insulin. 

[24] "Activity" of a polypeptide of the invention refers to structural, 
regulatory, or biochemical functions of a polypeptide in its native cell or tissue. Examples of 
activity of a polypeptide include both direct activities and indirect activities. Exemplary 
direct activities are the result of direct interaction with the polypeptide, , e.g., enzymatic 
activity, ligand binding, production or depletion of second messengers (e.^., cAMP, cGMP, 
IP3, DAG, or Ca^"^, ion flux, phosphorylation levels, transcription levels, and tiie like. 
Exemplary indirect activities are observed as a change in phenotype or response in a cell or 
tissue to a polypeptide's directed activity, e.g., modulating insulm sensitivity of a cell as a 
result of the interaction of the polypeptide with other cellular or tissue components. 

[25] *Tredisposition for diabetes" occurs in a person when the person is at 
high risk for developing diabetes. A number of risk factors are known to those of skill in the 
art and include: genetic factors (e.g., carrying alleles that result in a higher occurrence of 
diabetes than in the average population or having parents or siblings with diabetes); 
overweight (e.g., body mass mdex (BMI) greater or equal to 25 kg/m^); habitual physical 
inactivity, race/ethnicity (e.g., African-American, Hispanic-American, Native Americans, 
Asian-Americans, Pacific Islanders); previously identified impaired fasting glucose or 
impaired glucose tolerance, hypertension (e.g., greater or equal to 140/90 mmHg in adults); 
HDL cholesterol less than or equal to 35 mg/dl; triglyceride levels greater or equal to 250 
mg/dl; a history of gestational diabetes or delivery of a baby over nine pounds; and/or 
polycystic ovary syndrome. See^ e.g., "Report of the Expert Cormnittee on the Diagnosis and 
Classification of Diabetes Mellitus" and "Screening for Diabetes" Diabetes Care 25(1): S5- 
S24 (2002). 

[26] A "lean individual," when used to compare with a sample from a 
patient, refers to an adult with a fastrag blood glucose level less than 110 mg/dl or a 2 hour 
PG reading of 140 mg/dl. "Fasting"refers to no caloric intake for at least 8 hours. A **2 hour 
PG" refers to the level of blood glucose after challenging a patient to a glucose load 
containing the equivalent of 75g anhydrous glucose dissolved in water. The overall test is 
generally referred to as an oral glucose tolerance test (OGTT). See, e.g.. Diabetes Care, 
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Supplement 2002, AmoRan Diabetes Association: Clinical Pral^P Reconraiendations 2002. 
The level of a polypeptide in a lean individual can be a reading from a single individual, but 
is typically a statistically relevant average from a group of lean individuals. The level of a 
polypeptide in a lean individual can be represented by a value, for example in a computer 
5 program. 

[27] A "pre-diabetic individual," when used to compare with a sample from 
a patient, refers to an adult with a fasting blood glucose level greater than 110 mg/dl but less 
than 126 mg/dl or a 2 hour PG reading of greater than 140 mg/dl but less than 200mg/dl. A 
"diabetic individual," whca used to compare Avith a sample from a patient, refers to an adult 
10 with a fasting blood glucose level greater than 126 mg/dl or a 2 hour PG reading of greater 
than 200 mg/dl. 

[28] A "diabetes-related nucleic acid" or "diabetes-related polynucleotide" 
(also referred to as a "nucleic acid of the invention" or a "polynucleotide of the invention") of 
the invention is a subsequence or ftill-length polynucleotide sequence of a gene that encodes 

15 a polypeptide, whose activity modulates diabetes or insulin sensitivity, or whose presence or 
absence is indicative of diabetes or altered insulin sensitivity. Exemplary nucleic acids of the 
invention include those sequences substantially identical to SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO; 11 or SEQ ID NO: 13 or encode 
polypeptides substantially identical to SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 

20 NO:8, SEQ ID NO:10, SEQ ID NO:12 or SEQ ID NO:14. 

[29] An "agonist" refers to an agent that binds to, stimulates, increases, 
activates, facilitates, enhances activation, sensitizes or up regulates the activity or expression 
of a polypeptide of the invention. 

[30] An "antagonist refers to an agent that binds to, partially or totally 

25 blocks stimulation, decreases, prevents, delays activation, inactivates, desensitizes, or down 
regulates the activity or expression of a polypeptide of the invention. 

[31] "Antibody" refers to a polypeptide substantially encoded by an 
immunoglobulin gene or immunoglobulin genes, or fragments thereof which specifically bind 
and recognize an analyte (antigen). The recognized inmiimoglobuUn genes include the 

30 kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as the 

myriad immunoglobulin variable region genes. Light chains are classified as either kappa or 
lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn 
define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. 
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[32] JB^xemplary immunoj 



7 

iglobulin (antibodj 




LCtural unit comprises a 



tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
5 responsible for antigen recognition. The terms variable Ught chain (Vl) and variable heavy 
chain (Vh) refer to these light and heavy chains respectively. 



well-characterized fragments produced by digestion with varioxis peptidases. Thus, for 
example, pepsin digests an antibody below the disulfide linkages in the hinge region to 

1 0 produce F(ab)'2, a dimer of Fab which itself is a light cham joined to Vh-Ch1 by a disulfide 
bond. The F(ab) 2 may be reduced under mild conditions to break the disulfide linkage in the 
hinge region, thereby converting tite F(ab) 2 dimer into an Fab' monomer. The Fab* monomer 
is essentially an Fab with part of the hinge region {see, Paul (Ed.) Fundamental Immunology^ 
Third Edition, Raven Press, NY (1993)). While various antibody fragments are defined in 

1 5 terms of the digestion of an intact antibody, one of skill will appreciate that such fragments 
may be synthesized de novo either chemically or by utilizing recombinant DNA 
methodology. Thus, the term antibody, as used herein, also includes antibody fragments 
either produced by the modification of whole antibodies or those synthesized de novo using 
recombinant DNA methodologies (e.g., single chain Fv). 

20 [34] The terms "peptidomimetic" and "mimetic" refer to a synthetic 

chemical compound that has substantially the same structural and fimctional characteristics of 
the antagonists or agonists of the invention. Peptide analogs are commonly used in the 
pharmaceutical industry as non-peptide dmgs with properties analogous to those of the 
template peptide. These types of non-peptide compound are termed "peptide mimetics" or 

25 "peptidomimetics" (Fauchere, J. Adv. Drug Res. 15:29 (1986); Veber and Freidinger T/ATiSp. 
392 (1985); and Evans et al J. Med. Chem. 30:1229 (1987), which are incorporated herein by 
reference). Peptide mimetics that are structurally similar to therapeutically usefiil peptides 
may be used to produce an equivalent or enhanced therapeutic or prophylactic effect. 
Generally, peptidomimetics are structurally similar to a paradigm polypeptide (i.e., a 

30 polypeptide that has a biological or pharmacological activity), such as apolypeptide 

exemplified in this appUcation, but have one or more peptide linkages optionally replaced by 
a linkage selected from the group consisting of, e.g., -CH2NH-, -CH2S-, -CH2-CH2-, - 
CH=CH- (cis and trans), -COCH2-, -CH(OH)CH2-, and -CH2SO-. The mimetic can be 
either entirely composed of synthetic, non-natural analogues of amino acids, or, is a chimeric 



[33] Antibodies exist, e,g, as intact immunoglobulins or as a number of 
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molecule of partly nal 



p^tide amino acids and partly non-ri 




analogs 



of amino acids. 



The mimetic can also incorporate any amomit of natural amino acid conservative 
substitutions as long as such substitutions also do not substantially alter the mimetic's 
structure and/or activity. For example, a mimetic composition is within the scope of the 
5 invention if it is capable of carrying out the binding or other activities of an agonist or 
antagonist of a polypeptide of the invention. 



polypeptide chain; it includes regions preceding and following the coding region (leader and 
trailer) as well as intervening sequences (introns) between individual coding segments 



that the nucleic acid or protein is essentially free of other cellular components with which it is 
associated in the natural state. It is preferably in a homogeneous state although it can be in 
either a dry or aqueous solution. Purity and homogeneity are typically detemiined using 

15 analytical chemistry techniques such as polyacrylamide gel electrophoresis or high 

performance liquid chromatography. A protein that is the predominant species present in a 
preparation is substantially purified. In particular, an isolated gene is separated from open 
reading frames that flank the gene and encode a protein other than the gene of interest. The 
term "purified" denotes that a nucleic acid or protein gives rise to essentially one band in an 

20 electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least 85% pure, 
more preferably at least 95% pure, and most preferably at least 99% pure. 



deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double- 
stranded form. Unless specifically limited, the term encompasses nucleic acids containing 

25 known analogues of natural nucleotides that have similar binding properties as the reference 
nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. 
Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses 
conservatively modified variants thereof (e.g., degenerate codon substitutions) and 
complementary sequences as well as the sequence explicitly indicated. Specifically, 

30 degenerate codon substitutions may be achieved by generating sequences in which the third 
position of one or more selected (or all) codons is substituted with mixed-base and/or 
deoxyinosine residues (Batzer a/.. Nucleic Acid Res. 19:5081 (1991); Ohtsuka etaL^J, 
Biol Chem. 260:2605-2608 (1985); and Cassol et al (1992); Rossolini et al, Mol Cell 



[35] The term "gene" means the segment of DNA involved in producing a 



10 



(exons). 



[36] The tCTm "isolated," when applied to a nucleic acid or protein, denotes 



[37] The term "nucleic acid" or ^^polynucleotide" refers to 
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Probes 8:91-98 (1994)^^e term nucleic acid is used interchalljlli)ly with gene, cDNA, 
and mRNA encoded by a gene. 

[38] The terms "polypeptide," "peptide" and '"protein" are used 
interchangeably herein to refer to a polymer of amino acid residues. The terms apply to 
5 amino acid polymers in which one or more amino acid residue is an artificial chemical 

mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring 
amino acid polymers and non-naturally occurring amino acid polymers. As used herein, the 
terms encompass andno acid chains of any length, including full-length proteins (i.e., 
antigens), wherein the amino acid residues are linked by covalent peptide bonds. 

1 0 [39] The term "amino acid" refers to naturally occurring and synthetic 

wohio acids, as well as amino acid analogs and amino acid mimetics that function in a 
manner similar to the naturally occurring amino acids. Naturally occurring amino acids are 
those encoded by the genetic code, as well as those amino acids that are later modified, e.g.^ 
hydroxyproline, 7-carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to 

1 5 compoxmds that have the same basic chemical structure as a naturally occurring amino acid, 
i.e., an a. carbon that is boimd to a hydrogen, a carboxyl group, an amino group, and an R 
group, e.^., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfoniimi. 
Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but 
retain the same basic chemical structure as a naturally occurring amino acid. "Amino acid 

20 mimetics" refers to chemical compounds that have a stracture that is different firom flie 
general chemical structure of an amino acid, but which fimctions in a manner similar to a 
naturally occurring amino acid. 

[40] Amino acids may be referred to herein by either the commonly known 
three letter symbols or by the one-letter symbols recommended by the lUP AC-IUB 

25 Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their 
commonly accepted single-letter codes. 

[41] "Conservatively modified variants" applies to both amino acid and 
nucleic acid sequences. With respect to particular nucleic acid sequences, "conservatively 
modified variants" refers to those nucleic acids that encode identical or essentially identical 

30 amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 
nmnber of functionally identical nucleic acids encode any given protein. For instance, the 
codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every 
position where an alanine is specified by a codon, the codon can be altered to any of the 
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corresponding codons o^raibed without altering the encoded pll^tide. Such nucleic acid 
variations are "silent variations," which are one species of conservatively modified 
variations. Every nucleic acid sequence herein that encodes a polypeptide also describes 
every possible silent variation of the nucleic acid. One of skill will recognize that each codon 

5 in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, 
which is ordinarily the only codon for tryptophan) can be modified to yield a functionally 
identical molecule. Accordingly, each silent variation of a nucleic acid that encodes a 
polypeptide is implicit in each described sequence. 

[42] As to amino acid sequences, one of skill will recognize that individual 

10 substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 

sequence which alters, adds or deletes a smgle amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing functionally similar amino acids are well known in 

15 the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention. 

[43] The following eight groups each contain amino acids that are 
conservative substitutions for one another: 
1) Alanine (A), Glycine (G); 

20 2) Aspartic acid (D), Glutamic acid (E); 

3) Asparagine (N), Glutamine (Q); 

4) Arginine (R), Lysine (K); 

5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 
25 7) Serine (S), Threonine (T); and 

8) Cysteine (C), Methionine (M) 
{see, e,g,, Creighton, Proteins (1984)). 

[44] "Percentage of sequence identity*' is determined by comparing two 
optimally aligned sequences over a comparison window, wherein the portion of the 
30 polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., 
gaps) as compared to the reference sequence (e.g., a polypeptide of the invention), which 
does not comprise additions or deletions, for optimal aligmnent of the two sequences. The 
percentage is calculated by determining the nxmiber of positions at which the identical nucleic 
acid base or amino acid residue occurs in both sequences to yield the number of matched 
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positions, dividing the SBtoer of matched positions by the total^Pb)er of positions in the 
window of comparison and multiplying the result by 100 to yield the percentage of sequence 
identity. 

[45] The terms "identical" or percent "identity," in the context of two or 
more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences 
that are the same sequences are substantially identical if two sequences have a specified 
percentage of amino acid residues or nucleotides that are the same (i.e., 60% identity, 
optionally 65%, 70%, 75%, 80%, 85%, 90%, or 95% identity over a specified region, or, 
when not specified, over the entire sequence), when compared and aUgned for maximima 
correspondence over a comparison window, or designated region as measured using one of 
the following sequence comparison algorithms or by manual alignment and visual inspection. 
The invention provides polypeptides or polynucleotides that are substantially identical to the 
polypeptides or polynucleotides, respectively, exemplified herein (e.g., SEQ ID NO:l, SEQ 
ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID N0:6, SEQ ID NO:7, SEQ 
ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:ll, SEQ ID NO:12 , SEQ ID NO:13 
or SEQ ID NO: 14). This definition also refers to the complement of a test sequence. 
Optionally, the identity exists over a region that is at least about 50 nucleotides in length, or 
more preferably over a region that is 100 to 500 or 1000 or more nucleotides in length. 

[461 For sequence comparison, typically one sequence acts as a reference 
sequence, to which test sequences are compared. When using a sequence comparison 
algorithm, test and reference sequences are entered into a computer, subsequence coordinates 
are designated, if necessary, and sequence algorithm program parameters are designated. 
Default program parameters can be used, or altemative parameters can be designated. The 
sequence comparison algorithm tiien calculates the percent sequence identities for the test 
sequences relative to the reference sequence, based on the program parameters. 

[47] A "comparison window", as used herein, includes reference to a 
segment of any one of the number of contiguous positions selected from the group consisting 
of firom 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in 
which a sequence may be compared to a reference sequence of the same number of 
contiguous positions after the two sequences are optimally aligned. Methods of alignment of 
sequences for comparison are well known in the art. Optimal aUgnment of sequences for 
comparison can be conducted, e.g., by the local homology algorithm of Smith and Waterman 
(1970) Adv. Appl. Math. 2:482c, by the homology aUgmnent algorithm of Needleman and 
Wunsch (1970) J. Mol Biol 48:443, by the search for similarity method of Pearson and 
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Lipman (1988) Proc. ^^.Acad Set USA 85:2444, by compu^Bld implementations of 
these algorithms (GAP, BESTFTT, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group, 575 Science Dr., Madison, WT), or by manual 
alignment and visual inspection (see, e.g., Ausubel et aL, Current Protocols in Molecular 
Biology (1995 supplement)). 

[48] Two examples of algorithms that are suitable for determining percent 
sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which 
are described in Altschul et al (1977) Nuc. Acids Res. 25:3389-3402, and Altschul et al. 
(1990) J. Mol Biol 215:403-410, respectively. Software for performing BLAST analyses is 
publicly available through the National Center for Biotechnology Infomiation 
(http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 
sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 
either match or satisfy some positive-valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds for 
initiating searches to find longer HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be increased. 
Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward 
score for a pair of matching residues; always > 0) and N (penalty score for mismatching 
residues; always < 0). For amino acid sequences, a scoring matrix is used to calculate the 
cumulative score. Extension of the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X from its maximum achieved value; the 
cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the CTid of either sequence is reached. The BLAST algoriflim 
parameters W, T, and X detennine the sensitivity and speed of the alignment The BLASTN 
program (for nucleotide sequences) uses as defaults a wordlengfh (W) of 1 1, an expectation 
(E) or 10, M=5, N==-4 and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl Acad ScL USA 
89:10915) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both 
strands. 

[491 Th® BLAST algorithm also performs a statistical analysis of the 
similarity between two sequences (see, e.g., Karlin and Altschul (1993) Proc. Natl Acad. Scl 
USA 90:5873-5787). One measure of similarity provided by the BLAST algorithm is the 
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»13 
(P^), which provides an indication probability by which a 
match between two nucleotide or amino acid sequences would occur by chance. For 
example, a nucleic acid is considCTed similar to a reference sequence if the smallest sum 
probability in a comparison of the test nucleic acid to the reference nucleic acid is less than 
about 0.2, more preferably less than about 0.01, and most preferably less than about COOL 

[50] An indication that two nucleic acid sequences or polypeptides are 
substantially identical is that the polypeptide encoded by the first nucleic acid is 
iimmmologically cross reactive with the antibodies raised against the polypeptide encoded by 
the second nucleic acid, as described below. Thus, a polypeptide is typically substantially 
identical to a second polypeptide, for example, where the two peptides differ only by 
conservative substitutions. Another indication that two nucleic acid sequences are 
substantially identical is tiiat tiie two molecules or their complements hybridize to each other 
under stringent conditions, as described below. Yet another indication that two nucleic acid 
sequences are substantially identical is that the same primers can be used to ampUfy the 
sequence. 

[51] The phrase "selectively (or specifically) hybridizes to" refers to the 
binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence 
xmder stringent hybridization conditions when that sequence is present in a complex mixture 
(e.g., total cellular or library DNA or RNA). 

[52] The phrase "stringent hybridization conditions" refers to conditions 
under which a probe will hybridize to its target subsequence, typically in a complex mixture 
of nucleic acid, but to no other sequences. Stringent conditions are sequence-dependent and 
Avill be different in different curcumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen, Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic 
Probes y "Overview of principles of hybridization and the strategy of nucleic acid assays" 
(1993). Generally, stringent conditions are selected to be about 5-10° C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The Tm is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 
of the probes complementary to the target hybridize to the target sequence at equilibriimi (as 
the target sequences are present in excess, at Tm, 50% of the probes are occupied at 
equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1.0 M soditrai ion, typically about 0.01 to 1.0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g-., 10 to 
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50 nucleotides) and at UP about 60"* C for long probes {e.g., gri0^ than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 
formamide. For selective or specific hybridization, a positive signal is at least two times 
background, optionally 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5X SSC, and 1% SDS, 
incubating at 42^C, or 5X SSC, 1% SDS, incubating at 65°C, with wash in 0.2X SSC, and 
0.1% SDS at 55°C, eO^'C, or 65''C. Such washes can be performed for 5, 15, 30, 60, 120, or 
more minutes. 

[53] Nucleic acids that do not hybridize to each other under stringent 
conditions are still substantially identical if the polypeptides tihat they encode are 
substantially identical. This occurs, for example, when a copy of a nucleic acid is created 
using the maximum codon degeneracy permitted by the genetic code. In such cases, the 
nucleic acids typically hybridize under moderately stringent hybridization conditions. 
Exemplary "moderately stringent hybridization conditions" include a hybridization in a 
buffer of 40% formamide, 1 M NaCl, 1% SDS at 3TC, and a wash in IX SSC at 45''C. Such 
washes can be performed for 5, 15, 30, 60, 120, or more minutes. A positive hybridization is 
at least twice backgroimd. Those of ordinary skill will readily recognize that alternative 
hybridization and wash conditions can be utilized to provide conditions of similar stringency. 

[54] The phrase "a nucleic acid sequence encoding" refers to a nucleic acid 
which contains sequence information for a structural RNA such as rRNA, a tRNA, or the 
primary amino acid sequence of a specific protein or peptide, or a binding site for a trans- 
acting regulatory agent. This phrase specifically encompasses degenerate codons (Le., 
different codons which encode a single amino acid) of the native sequence or sequences that 
may be introduced to conform with codon preference in a specific host cell, 

[55] The term 'Recombinant" when used with reference, e.g., to a cell, or 
nucleic acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has 
been modified by the introduction of a heterologous nucleic acid or protein or the alteration 
of a native nucleic acid or protein, or that the cell is derived firom a cell so modified. Thus, 
for example, recombinant cells express genes that are not found within the native 
(nonrecombinant) form of the cell or express native genes that are otherwise abnormally 
expressed, under-expressed or not expressed at all. 

[56] The term "heterologous" when used with reference to portions of a 
nucleic acid indicates that the nucleic acid comprises two or more subsequences that are not 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
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»15 
duced, having two or more sequenc^^^m unrelated genes 
arranged to make a new functional nucleic acid, e.g.y a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein indicates that the 
protein comprises two or more subsequences that are not found ia the same relationship to 
5 each other in nature (e.g., a fusion protein). 

[57] An "expression vector" is a nucleic acid construct, generated 
recombinantly or synthetically, with a series of specified nucleic acid elements that permit 
transcription of a particular nucleic acid in a host cell. The expression vector can be part of a 
plasmid, vims, or nucleic acid fragment. Typically, the expression vector includes a nucleic 
1 0 acid to be transcribed operably linked to a promoter. 

[581 The phrase "specifically (or selectively) binds to an antibody" or 
"specifically (or selectively) immunoreactive with", when referring to a protein or peptide, 
refers to a binding reaction which is determinative of the presence of the protein in the 
presence of a heterogeneous population of proteins and other biologies. Thus, under 
15 designated immunoassay conditions, the specified antibodies bind to a particular protein and 
do not bind in a significant amoimt to other proteins present in the sample. Specific binding 
to an antibody under such conditions may require an antibody that is selected for its 
specificity for a particular protein. For example, antibodies raised against a protein having an 
amino acid sequence encoded by any of the polynucleotides of the invention can be selected 
20 to obtain antibodies specifically immunoreactive with that protein and not with other 

proteins, except for polymorphic variants. A variety of inmiunoassay formats may be used to 
select antibodies specifically immimoreactive with a particular protein. For example, solid- 
phase ELISA inamunoassays. Western blots, or iimnunohistochemistry are routinely used to 
select monoclonal antibodies specifically immunoreactive with a protein. See, Harlow and 
25 Lane Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, NY (1988) for a 
description of immunoassay formats and conditions that can be used to determine specific 
immimoreactivity. Typically, a specific or selective reaction will be at least twice the 
background signal or noise and more typically more than 10 to 100 times background. 

[59] "Inhibitors," "activators," and "modulators" of expression or of 
30 activity are used to refer to inhibitory, activating, or modulating molecules, respectively, 

identified using in vitro and in vivo assays for expression or activity. Modulators encompass 
e.g-., ligands, agonists, antagonists, and their homologs and mimetics. The term **modulator*' 
includes inhibitors and activators. Inhibitors are agents that, e.g., inhibit expression of a 
polypeptide of the invention or bind to, partially or totally block stimulation, decrease. 
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prevoit, delay activatioflnactivate, desensitize, or down regula^^ activity of a 
polypeptide of tiie invention, e.g., antagonists. Activators are agents that, e.g., induce or 
activate the expression of a polypeptide of the invention or bind to, stunulate, increase, open, 
activate, facilitate, or rahance activation, sensitize or up regulate the activity of a polypeptide 
of the invention, e.g., agonists. Modulators include naturally occurring and synthetic ligands, 
antagonists, agonists, small chemical molecules and the like. Such assays for inhibitors and 
activators include, e.g., applying putative modulator compoimds to cells expressing a 
polypeptide of the invention and then determining the functional effects on a polypeptide of 
the invention activity, as described above. Samples or assays comprising a polypeptide of the 
invention that are treated with a potential activator, inhibitor, or modulator are compared to 
control samples without the inhibitor, activator, or modulator to examine the extent of effect. 
Control samples (untreated with modulators) are assigned a relative activity value of 100%. 
Inhibition of a polypeptide of the invOTtion is achieved when the polypeptide activity value 
relative to the control is about 80%, optionally 50% or 25, 10%, 5% or 1%. Activation of the 
polypeptide is achieved when the polypeptide activity value relative to the control is 110%, 
optionally 150%, optionally 200, 300%, 400%, 500%, or 1000-3000% or more higher. 

DETAILED DESCRIPTION OF THE INVENTION 

L INTRODUCTION 

[60] The present invention demonstrates that, surprisingly, modulated levels 
of mRNA comprising sequences of the invention occur in muscle from diabetic individuals 
treated with thiazolidinedione (TZD) compared to levels of the mRNA m the individuals 
before TZD treatment. Without intending to limit tiie invention to a particular mechanism of 
action, it is believed that modulation of the expression or activity of the polypeptides or 
polynucleotides of the uivention is beneficial in treating diabetic, pre-diabetic or obese 
insuhn resistant, non-diabetic patients. Furthermore, modulated levels of the polypeptides of 
the invention are indicative of insulm resistance. Thus, the detection of a polypeptide of the 
invention is useful for diagnosis of diabetes and insulin resistance. 

[61] This invention also provides methods of using polypeptides of the 
invention and modulators of the polypeptides of the invention to diagnose and treat diabetes, 
pre-diabetes (including insulin resistant individuals) and related metabolic diseases. The 
present method also provides methods of identifying modulators of expression or activity of 
the polypeptides of the invention. Such modulators are useful for treating Type 2 diabetes as 
well as the patiiological aspects of diabetes (e.g., insulin resistance). 



wo 03/102161 PCTAJS03/17725 

n. GENERAL Rj^TOMBINANT NUCLEIC ACID ME'l^DS FOR USE 
WITH THE INVENTION 

[62] In numerous embodiments of the present invention, nucleic acids 
encoding a polypq>tide of the present invention will be isolated and cloned using 
recombinant methods. Such embodiments are used, e.g., to isolate polynucleotides identical 
or substantially identical to SEQ ID NO:l, SEQ ID N0:3, SEQ ID NO:5, SEQ ID NO:7, 
SEQ ID NO:9, SEQ ID NO:l 1 or SEQ ID N0:13 for protein expression or during the 
generation of variants, derivatives, expression cassettes, or other sequences derived from an 
polypeptide or polynucleotide of the invention, to monitor gene expression, for the isolation 
or detection of sequences in different species, for diagnostic purposes in a patient, e.g., to 
detect mutations in a polypeptide or polynucleotide of the invention or to detect expression 
levels of nucleic acids or polypeptides. In some embodiments, the sequences encoding the 
polypeptides of the invention are operably linked to a heterologous promoter. In one 
embodiment, the nucleic acids of the invention are from any mammal, including, in 
particular, e.g., a human, a mouse, a rat, etc. 

A. General Recombinant Nucleic Acid Methods 

[63] This invention relies on routine techniques in the field of recombinant 
genetics. Basic texts disclosing the general methods of use in this invention include 
Sambrook et al. Molecular Cloning, A Laboratory Manual (3rd ed. 2001); Kriegler, Gene 
Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in Molecular 
Biology (Ausubel et al., eds., 1994)). 

[64] For nucleic acids, sizes are given in either kilobases (kb) or base pairs 
(bp). These are estimates derived from agarose or acrylamide gel electrophoresis, from 
sequenced nucleic acids, or from published DNA sequences. For proteins, sizes are givra in 
kilodaltons QdDa) or amino acid residue numbers. Proteins sizes are estimated from gel 
electrophoresis, from sequenced protems, from derived amino acid sequences, or from 
published protein sequences. 

[65] Oligonucleotides that are not commCTcially available can be 
chemically synthesized according to the soUd phase phosphoramidite triester method first 
described by Beaucage & Caruthers, Tetrahedron Letts. 22:1859-1862 (1981), using an 
automated synthesizer, as described in Van Devanter et. al. Nucleic Acids Res. 12:6159-6168 
(1984). Purification of oligonucleotides is by either native acrylamide gel electrophoresis or 
by anion-exchange HPLC as described in Pearson & Reanier, J. Chrom. 255:137-149 (1983). 
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[66] flRequence of the cloned genes and sy^^l^c oligonucleotides can 
be verified after cloning using, e.g., the chain termination method for sequencing double- 
stranded templates of Wallace et al. Gene 16:21-26 (1981). 

B. Cloning Methods for the Isolation of Nucleotide Sequences Encoding 
Desired Proteins 

[67] In general, the nucleic acids encoding the subject proteins are cloned 
from DNA sequence libraries that are made to encode cDNA or genomic DNA. The 
particular sequences can be located by hybridizing with an oligonucleotide probe, the 
sequence of which can be derived from the sequences disclosed herein, which provide a 
reference for PGR primers and defines suitable regions for isolating probes pecific for the 
polypeptides or polynucleotides of the invention. Alternatively, where the sequence is cloned 
into an expression Ubrary, the expressed recombinant protein can be detected 
immimologically with antisera or purified antibodies made against a polypeptide of mterest, 
including those disclosed herein. 

[68] Methods for making and screening genomic and cDNA libraries are 
well known to those of skill m the art {see, e.g., Gubler and Hoffinan Gene 25:263-269 
(1983); Benton and Davis Science, 196:180-182 (1977); and Sambrook, supra). 

[69] Briefly, to make the cDNA library, one should choose a source that is 
rich in mRNA. The mRNA can then be made into cDNA, Ugated into a recombinant vector, 
and transfected into a recombinant host for propagation, screening and cloning. For a 
genomic library, the DNA is extracted from a suitable tissue and either mechanically sheared 
or enzymatically digested to yield fragments of preferably about 5-100 kb. The fragments are 
then separated by gradient centrifiigation from undesired sizes and are constmcted in 
bacteriophage lambda vectors. These vectors and phage are packaged in vitro, and the 
recombinant phages are analyzed by plaque hybridization. Colony hybridization is carried 
out as generally described in Grunstein et al., Proc. Natl Acad. ScL USA., 72:3961-3965 
(1975). 

[70] An alternative method combines the use of synthetic oUgonucleotide 
primers with polymerase extension on an n:iRNA or DNA template. Suitable primers can be 
designed from specific sequences disclosed herein. This polymerase chain reaction (PGR) 
method amplifies the nucleic acids encoding the protein of interest directly from noRNA, 
cDNA, genomic libraries or cDNA Ubraries. Restriction endonuclease sites can be 
incorporated into the primers. Polymerase chain reaction or other in vitro amplification 
methods may also be useftil, for example, to clone nucleic acids encoding specific proteins 
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and express said proteiSR synthesize nucleic acids that will b4|ld as probes for detecting 
the presence of mRNA encoding a polypeptide of the invention in physiological samples, for 
nucleic acid sequencing, or for other purposes {see^ U.S. Patent Nos. 4,683,195 and 
4,683,202), Genes amplified by a PGR reaction can be purified from agarose gels and cloned 
into an appropriate vector. 

[711 Appropriate primers and probes for identifying the genes encoding a 
polypeptide of the invention firom mammalian tissues can be derived jfrom the sequences 
provided herein. For a general overview of PGR, see, Imiis et al PCR Protocols: A Guide to 
Methods and Applications, Academic Press, San Diego (1990). 

[72] Synthetic oligonucleotides can be used to construct genes. This is 
done iising a series of overlapping oligonucleotides, usually 40-120 bp in length, representing 
both the sense and anti-sense strands of the gene. These DNA fiagments are then annealed, 

ligated and cloned. 

[73] A polynucleotide encoding a polypeptide of the invention can be 
cloned using intermediate vectors before transformation into mammalian cells for expression. 
These intermediate vectors are typically prokaryote vectors or shuttle vectors. The proteins 
can be expressed in either prokaryotes or eukaryotes, using standard methods well known to 
those of skill in the art. 

in. PURIFICATION OF PROTEINS OF THE INVENTION 

[74] Either naturally occurring or recombinant polypeptides of the invention 
can be purified for use in functional assays. Naturally occurring polypeptides of the 
invention can be purified from any source (e.g., tissues of an organism expressing an 
ortholog). Recombinant polypeptides can be purified firom any suitable expression system. 

[75] The polypeptides of the invention may be purified to substantial purity 
by standard techniques, including selective precipitation with such substances as ammonium 
sulfate; column chromatography, immunopurification metiiods, and others {see, e.g.. Scopes, 
Protein Purification: Principles and Practice (1982); U.S. Patent No. 4,673,641; Ausubel et 
al, supra; and Sambrook et al, supra). 

[76] A number of procedures can be employed when recombinant 
polypeptides are being purified. For example, proteins having established molecular 
adhesion properties can be reversibly fiised to a polypeptide of the invention. With the 
appropriate ligand, either protein can be selectively adsorbed to a pxirification colimm and 
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then freed from the coliSm in a relatively pure form. The fused^^ein may be then removed 
by enzymatic activity. Finally polypeptides can be purified using immunoafOnity colunans. 

A. Purification of Proteins from Recombinant Bacteria 

[77] When recombinant proteins are expressed by the transfomied bacteria 
in large amounts, typically after promoter induction, although expression can be constitutive, 
the proteins may form insoluble aggregates. There are several protocols that are suitable for 
purification of protein inclusion bodies. For example, purification of aggregate proteins 
(hereinaftCT referred to as inclusion bodies) typically involves the extraction, separation 
and/or purification of inclusion bodies by disraption of bacterial cells typically, but not 
limited to, by incubation in a buffer of about 100-150 jxg/ml lysozyme and 0.1% Nonidet 
P40, a non-ionic detergent. The cell suspension can be ground using a Polytron grinder 
(Brinkman Instruments, Westbury, NY). Alternatively, the cells can be sonicated on ice. 
Altemate methods of lysing bacteria are described in Ausubel et al and Sambrook et al, both 
supra, and will be apparent to those of skill in the art. 

[781 The cell suspension is generally centriftiged and the pellet containing 
the inclusion bodies resuspended in buffer which does not dissolve but washes the inclusion 
bodies, e.g., 20 mM Tris-HCl (pH 7.2), 1 mM EDTA, 150 mM NaCl and 2% Triton-X 100, a 
non-ionic detergent. It may be necessary to repeat the wash step to remove as much cellular 
debris as possible. The remaining pellet of inclusion bodies may be resuspended in an 
appropriate buffer (e.g., 20 mM sodium phosphate, pH 6.8, 150 mM NaCl). Other 
appropriate buffers will be apparent to those of skill in the art, 

[79] Following the washing step, the inclusion bodies are solubilized by the 
addition of a solvent that is both a strong hydrogen acceptor and a strong hydrogen donor (or 
a combination of solvents each having one of these properties). The proteins that formed the 
inclusion bodies may then be renatxured by dilution or dialysis with a compatible buffer. 
Suitable solvents include, but are not limited to, urea (from about 4 M to about 8 M), 
formamide (at least about 80%, volume/volume basis), and guanidine hydrochloride (from 
about 4 M to about 8 M). Some solvents that are capable of solubilizing aggregate-forming 
proteins, such as SDS (sodixun dodecyl sulfate) and 70% formic acid, are inappropriate for 
use in this procedure due to the possibility of irreversible denaturation of the proteins, 
accompanied by a lack of immxmogenicity and/or activity. Although guanidine 
hydrochloride and similar agents are denaturants, this denaturation is not irreversible and 
renaturation may occur upon removal (by dialysis, for example) or dilution of the denaturant. 
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allowing re-fonnation immunologically and/or biological^^ive protein of interest 
After solubilization, the protein can be separated from other bacterial proteins by standard 

separation techniques. 

[80] Altematively, it is possible to purify proteins from bacteria periplasm. 
Where the protein is exported into the periplasm of the bacteria, the periplasmic fraction of 
the bacteria can be isolated by cold osmotic shock in addition to other methods known to 
those of skill in the art (see, Ausubel et al, supra). To isolate recombinant proteins from the 
periplasm, the bacterial cells are centriftiged to form a pellet. The pellet is resuspended in a 
buffer containing 20% sucrose. To lyse the cells, the bacteria are centrifuged and the pellet is 
resuspended in ice-cold 5 mM MgS04 and kept in an ice bath for approximately 10 minutes. 
The cell suspaision is centriftiged and the supernatant decanted and saved. The recombinant 
proteins present in the supernatant can be separated from the host proteins by standard 
separation techniques well known to those of skill in the art. 



B. Purification of Proteins from Insect Cells 

[81] Proteins can also be pvuified from eukaryotic gene expression systems 
as described in, e.g., Fernandez and Hoeffler, Gene Expression Systems (1999). In some 
embodiments, baculovirus expression s>«tems are used to isolate protems of the invention. 
Recombinant baculoviruses are generally generated by replacing the polyhedrin coding 
sequence of a baculovirus with a gene to be expressed (e.g., raicoding a polypeptide of the 
invention). Viruses lacking the polj4iedrin gene have a unique plaque morphology making 
them easy to recognize. In some embodiments, a recombinant baculovirus is generated by 
first cloning a polynucleotide of interest into a transf«: vector (e.g., a pUC based vector) such 
that the polynucleotide is operably linked to a polyhedrin promoter. The transfer vector is 
transfected with wildtype DNA into an insect cell (e.g., Sf9, S£21 or BT1-TN-5B1-4 cells), 
resulting in homologous recombination and replacement of the polyhedrin gene in the 
wildtype viral DNA with the polynucleotide of interest. Virus can then be generated and 
plaque purified. Protein expression results upon viral infection of insect cells. Expressed 
proteins can be harvested from cell supernatant if secreted, or from cell lysates if 
intracellular. See, e,g., Ausubel et al. and Fernandez and Hoeffler, supra. 
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C. StandanRrotein Separation Techniques For i^llfying Proteins 

1. SolubUitv Fractionation 

[82] Often as an initial step, and if the protein mixture is complex, an initial 
salt fractionation can separate many of the unwanted host cell proteins (or proteins derived 

5 from the cell culture media) from the recombinant protein of interest. The preferred salt is 
ammonimn sulfate. Ammonium sulfate precipitates proteins by effectively reducing the 
amount of water in the protein mixture. Proteins then precipitate on the basis of their 
solubility. The more hydrophobic a protein is, the more likely it is to precipitate at lower 
ammonium sulfate concentrations. A typical protocol is to add saturated ammonium sulfate 

10 to a protein solution so that the resultant ammonium sulfate concentration is between 20- 
30%. This will precipitate the most hydrophobic proteins. The precipitate is discarded 
(unless the protein of interest is hydrophobic) and ammonium sulfate is added to the 
supernatant to a concentration known to precipitate the protein of interest. The precipitate is 
then solubiUzed in buffer and tihie excess salt removed if necessary, through either dialysis or 

1 5 diafiltration. Other methods that rely on solubility of proteins, such as cold ethanol 
precipitation, are well known to those of skill in the art and can be used to fractionate 
complex protein mixtures. 

2. Size Differential Filtration 

[83] Based on a calculated molecular weight, a protein of greater and lesser 
20 size can be isolated using ultrafiltration through membranes of different pore sizes (for 
example, Amicon or Millipore membranes). As a first step, the protein mixture is 
ultrafiltered through a membrane with a pore size that has a lower molecular weight cut-off 
than the molecular weight of the protein of interest. The retentate of the ultrafiltration is then 
ultrafiltered against a membrane with a molecular cut off greater than the molecular weight of 
25 the protein of interest. The recombinant protein will pass through the membrane into the 
filtrate. The filtrate can then be chromatographed as described below. 

3. Colum n Chro matographv 

[841 The proteins of interest can also be separated from other proteins on 
the basis of their size, net surface charge, hydrophobicity and affinity for ligands. In 
30 addition, antibodies raised against proteins can be conjugated to column matrices and the 
proteins immunopurified. All of these methods are well known in the art. 

[85] Lnmunoaffinity chromatography using antibodies raised to a variety of 
affinity tags such as hemagglutinin (HA), FLAG, Xpress, Myc, hexahistidine (His), 
glutathione S transferase (GST) and the like can be used to purify polypeptides. The His tag 
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will also act as a chelaui^gent for certain metals (e.g., Ni) and^^ the metals can also be 
used to pvirify His-containing polypq)tides. After purification, the tag is optionally removed 
by specific proteolytic cleavage. 

[86] It will be apparent to one of skill that chromatographic techniques can 
5 be performed at any scale and using equipment firom many different manufacturers (e,g,, 
Pharmacia Biotech). 

IV. DETECTION OF POLYNUCLEOTIDES OF THE INVENTION 

[87] Those of skill in the art will recognize that detection of expression of 
polynucleotides and polypeptides of the invention has many uses. For example, as discussed 

10 herein, detection of levels of polynucleotides and polypeptides of the invention in a patient is 
useful for diagnosing diabetes or a predisposition for at least some of the pathological effects 
of diabetes. Moreover, detection of gene expression is usefid to idratify modulators of 
expression of polynucleotides and polypeptides of the invention. 

[88] A variety of methods of specific DNA and RNA measurement that use 

15 nucleic acid hybridization techniques are known to those of skill in the art (see, Sambrook, 
supra). Some methods involve an electrophoretic separation {e.g.y Southern blot for detecting 
DNA, and Northem blot for detecting RNA), but measurement of DNA and RNA can also be 
carried out in the absence of electrophoretic separation (e.g., by dot blot). Southem blot of 
genomic DNA (e.g. from a human) can be used for screening for restriction firagment length 

20 polymorphism (RFLP) to detect the presence of a genetic disorder affecting a polypeptide of 
the invention. 

[89] The selection of a nucleic acid hybridization format is not critical, A 
variety of nucleic acid hybridization formats are known to those skilled in the art. For 
example, common formats include sandwich assays and competition or displacement assays. 
25 Hybridization techniques are generally described in Hames and Higgins Nucleic Acid 

Hybridization, A Practical Approach, IRL Press (1985); Gall and Pardue, Proc. Natl Acad. 
Set USA., 63:378-383 (1969); and John et al. Nature, 223:582-587 (1969). 

[90] Detection of a hybridization complex may require the binding of a 
signal-generating complex to a duplex of target and probe polynucleotides or nucleic acids. 
30 Typically, such binding occurs through Ugand and anti-ligand interactions as between a 
ligand-conjugated probe and an anti-ligand conjugated with a signal. The binding of the 
signal generation complex is also readily amenable to accelerations by exposure to ultrasonic 
energy. 
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[911 



label may also allow indirect detectii 




'the hybridization 



complex. For example, where the label is a hapten or antigen, the sample can be detected by 
using antibodies. In these systems, a signal is generated by attaching fluorescent or enzyme 
molecules to the antibodies or in some cases, by attachment to a radioactive label {see^ e.g., 
5 Tijssen, ''Practice and Theory of Enzyme Immunoassays'' Laboratory Techniques in 

Biochemistry and Molecular Biology, Burdon and van Knippenberg Eds., Elsevier (1985), pp 
9-20). 



chromophores, Imniphores, chromogens, or indirectly, such as with biotin, to which a 
10 streptavidin complex may later bind. Thus, flie detectable labels used in the assays of the 

present invention can be primary labels (where the label comprises an element that is detected 
directly or that produces a directly detectable element) or secondary labels (where the 
detected label binds to a primary label, e.g., as is conmion in inMnxmological labeling). 
Typically, labeled signal nucleic acids are used to detect hybridization. Complementary 
15 nucleic acids or signal nucleic acids may be labeled by any one of several mefliods typically 
used to detect the presence of hybridized polynucleotides. The most common method of 
detection is the use of autoradiography with ^H, ^^^I, ^^S, ^"^C, or ^^P-labeled probes or the 
hke. 



20 fluorophores, chemiluminescent agents, enzymes, and antibodies that can serve as specific 

binding pair members for a labeled ligand. An introduction to labels, labehng procedures and 
detection of labels is found in Polak and Van Noorden Introduction to Immunocytochemistry, 
2nd ed.. Springer Verlag, NY (1997); and m Haugland Handbook of Fluorescent Probes and 
Research Chemicals, a combined handbook and catalogue Published by Molecular Probes, 

25 hic. (1996). 



combination is used to detect the detection reagent label. Typical detectors include 
spectrophotometers, phototubes and photodiodes, microscopes, scintillation coxmters, 
cameras, film and the like, as well as combinations thereof Examples of suitable detectors 
30 are widely available from a variety of commercial sources known to persons of skill in the 
art. Commonly, an optical image of a substrate comprising bound labeling moieties is 
digitized for subsequent computer analysis. 



amount of label fixed to the sohd support by binding of the detection reagent. Typically, the 



[92] The probes are typically labeled either directly, as with isotopes. 



[93] Other labels include, e,g. , ligands that bind to labeled antibodies. 



[94] in general, a detector that monitors a particular probe or probe 



[95] The amoimt of, for example, an RNA is measured by quantitating the 
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presence of a modulato^imng incubation will increase or decrel^^e amoimt of label fixed 
to the solid support relative to a control incubation that does not comprise the modulator, or 
as compared to a baseline established for a particular reaction type. Means of detecting and 
quantitating labels are well known to those of skill in the art. 

[96] In some embodiments, the target nucleic acid or the probe is 
immobilized on a solid support. Solid supports suitable for use in the assays of the invention 
are known to those of skill in the art. As used herein, a solid support is a matrix of material 
in a substantially fixed arrangement. 

[97] A variety of automated solid-phase assay techniques are also 
appropriate. For instance, very large scale immobilized polymer arrays (VLSIPS™), i.e. 
Gene Chips or microairays, available from Affymetrix, Inc. in Santa Clara, CA can be used 
to detect changes in expression levels of a plurality of genes involved in the same regulatory 
patihiways simultaneously. See^ Tijssen, supra.^ Fodor et al (1991) Science, 251: 767- 777; 
Sheldon et al (1993) Clinical Chemistry 39(4): 718-719, and Kozal et al (1996) Nature 
Medicine 2(7): 753-759. Similarly, spotted cDNA arrays (arrays of cDNA sequences bound 
to nylon, glass or another solid support) can also be used to monitor expression of a plurality 
of genes. 

[98] Typically, the array elements are organized in an ordered fashion so 
that each element is present at a specified location on the substrate. Because the array 
elements are at specified locations on the substrate, the hybridization pattems and intensities 
(which together create a unique expression profile) can be interpreted in terms of expression 
levels of particular genes and can be correlated with a particular disease or condition or 
treatment See, e.g,, Schena et al. Science 270: 467-470 (1995)) and (Lockhart et al. Nature 
Biotech. 14: 1675-1680 (1996)). 

[99] Hybridization specificity can be evaluated by comparing the 
hybridization of specificity-control polynucleotide sequences to specificity-control 
polynucleotide probes that are added to a sample in a known amount. The specificity-control 
target polynucleotides may have one or more sequence mismatches compared witii the 
corresponding polynucleotide sequences. In this manner, whether only complementary target 
polynucleotides are hybridizing to the polynucleotide sequences or whether mismatched 
hybrid duplexes are forming is determined. 

[100] Hybridization reactions can be performed in absolute or differential 
hybridization formats. In the absolute hybridization format, polynucleotide probes from one 
sample are hybridized to the sequences in a microarray format and signals detected after 
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hybridization complex 



lation correlate to polynucleotide proi 




els in a sample. In the 



differential hybridization format, the differential expression of a set of genes in two 
biological samples is analyzed. For differential hybridization, polynucleotide probes from 
both biological samples are prepared and labeled with different labeling moieties. A mixture 

5 of the two labeled polynucleotide probes is added to a microarray. The microarray is then 
examined under conditions in which the emissions from the two different labels are 
individually detectable. Sequences in the microarray that are hybridized to substantially equal 
nimibers of polynucleotide probes derived from both biological samples give a distinct 
combined fluorescence (Shalon et al PCT pubUcation WO95/35505). In some embodiments, 

10 the labels are fluorescent labels with distinguishable emission spectra, such as Cy3 and Cy5 
fluorophores. 



nucleic acids and complex formation between the hybridizable array elements and the 
polynucleotide probes is detected. Methods for detecting complex formation are well known 
15 to those skilled in the art. In some embodiments, the polynucleotide probes are labeled with a 
fluorescent label and measurement of levels and patterns of fluorescence indicative of 
complex formation is accomphshed by fluorescence microscopy, such as confocal 
fluorescence microscopy. 



20 two or more different biological samples are labeled Avith two or more different fluorescent 
labels with different emission wavelengths. Fluorescent signals are detected separately with 
different photomultipUers set to detect specific wavelengths. The relative 
abundances/expression levels of the polynucleotide probes in two or more samples are 
obtained. 

25 [1 03] Typically, microarray fluorescence intensities can be normalized to 

take into account variations in hybridization intensities when more than one microarray is 
used under similar test conditions. In some embodiments, individual polynucleotide 
probe/target complex hybridization intensities are normalized using tiie intensities derived 
from internal normalization controls contained on each microarray. 

30 [104] Detection of nucleic acids can also be accomplished, for example, by 

using a labeled detection moiety that binds specifically to duplex nucleic acids ie,g,, an 
antibody that is specific for RNA-DNA duplexes). One example uses an antibody that 
recognizes DNA-RNA heteroduplexes in which the antibody is linked to an enzyme 
(typically by recombinant or covalent chemical bonding). The antibody is detected when the 



[1011 After hybridization, the microarray is washed to remove nonhybridized 



[102] In a differential hybridization experiment, polynucleotide probes from 
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enzyme reacts with its ^^rate, producing a detectable product.^^tlee et al (1989) 
Analytical Biochemistry 181:153-162; Bogulavski (1986) et al J. Immunol Methods 89:123- 
130; Prooijen-Knegt (1982) £xp. Cell Res. 141:397-407; Rudkin (1976) JViiftir^ 265:472-473, 
StoUar (1970) PNAS 65:993-1000; Ballard (1982) Mol Immunol 19:793-799; Pisetsky and 
Caster (1982) Mol Immunol 19:645-650; Viscidi et al (1988) J. Clin. Microbial 41:199- 
209; and Kiney et al (1989) J. Clin, Microbiol 27:6-12 describe antibodies to RNA 
duplexes, including homo and heteroduplexes. Kits comprising antibodies specific for 
DNA:RNA hybrids are available, e,g,, firom Digene Diagnostics, Mc. (Beltsville, MD). 

[105] In addition to available antibodies, one of skill in the art can easily 
make antibodies specific for nucleic acid duplexes using existing techniques, or modify those 
antibodies that are commercially or publicly available. In addition to the art referenced 
above, general methods for producing polyclonal and monoclonal antibodies are known to 
those of skill in the art {see, e.g., Paul (ed) Fundamental Immunology, Third Edition Raven 
Press, Ltd,, NY (1993); Coligan Current Protocols in Immunology Wiley/Greene, NY 
(1991); Harlow and Lane Antibodies: A Laboratory Manual Cold Spring Harbor Press, NY 
(1989); Stites et al (eds.) Basic and Clinical Immunology (4th ed.) Lange Medical 
Publications, Los Altos, CA, and references cited ttierein; Goding Monoclonal Antibodies: 
Principles and Practice (2d ed,) Academic Press, New York, NY, (1986); and Kohler and 
Milstein Nature 256: 495-497 (1975)). Other suitable techniques for antibody preparation 
include selection of libraries of recombinant antibodies in phage or similar vectors {see, Huse 
et al Science 246:1275-1281 (1989); and Ward et al Nature 341:544-546 (1989)). Specific 
monoclonal and polyclonal antibodies and antisera will usually bind with a Kd of at least 
about 0.1 hM, preferably at least about 0.01 pM or better, and most typically and preferably, 
0.001 mM or better. 

[106] The nucleic acids used in this invention can be either positive or 
negative probes. Positive probes bind to their targets and the presence of duplex formation is 
evidence of the presence of the target. Negative probes fail to bind to the suspect target and 
the absence of duplex formation is evidence of the presence of the target For example, the 
use of a wild type specific nucleic acid probe or PGR primers may serve as a negative probe 
in an assay sample where only the nucleotide sequence of interest is present. 

[107] The sensitivity of the hybridization assays may be enhanced through 
use of a nucleic acid amplification system that multipUes the target nucleic acid being 
detected. Examples of such systems include the polymerase chain reaction (PGR) system and 
the Ugase chain reaction (LCR) system. Other methods recently described in the art are the 
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nucleic acid sequence mR amplification (NASB A, Cangene, iJ^fcsauga, Ontario) and Q 
Beta Replicase systems. These systems can be used to directly identify mutants where the 
PGR or LCR primers are designed to be extended or ligated only when a selected sequence is 
present. Alternatively, the selected sequences can be generally amplified using, for example, 
nonspecific PGR primers and the amplified target region later probed for a specific sequence 
indicative of a mutation. It is understood that various detection probes, including Taqman 
and molecular beacon probes can be used to monitor ampUfication reaction products, e.g., in 
real time. 

[108] An alternative means for determining the level of expression of the 
nucleic acids of the present invention is in situ hybridization. In situ hybridization assays are 
well known and are generally described m Angerer et al. Methods Enzymol 152:649-660 
(1987). In an in situ hybridization assay, cells, pref«:entially human cells firom the 
cerebellum or the hippocampus, are fixed to a solid support, typically a glass slide. If DNA is 
to be probed, the cells are denatured with heat or alkali. The cells are then contacted with a 
hybridization solution at a moderate temp^ature to permit annealing of specific probes that 
are labeled. The probes are preferably labeled with radioisotopes or fluorescent reporters. 

[109] Single nucleotide polymorphism (SNP) analysis is also usefiil for 
detecting differences between alleles of the polynucleotides (e.g., genes) of the invention. 
SNPs linked to genes encoding polypeptides of the invention are usefiil, for instance, for 
diagnosis of diseases (e.g., diabetes) whose occurrence is linked to the gene sequences of the 
invention. For example, if an individual carries at least one SNP linked to a disease- 
associated allele of the gene sequences of the invention, the individual is likely predisposed 
for one or more of those diseases. If the individual is homozygous for a disease-linked SNP, 
the individual is particularly predisposed for occurrence of that disease (e.g., diabetes). In 
some embodiments, the SNP associated with the gene sequences of the invention is located 
within 300,000; 200,000; 100,000; 75,000; 50,000; or 10,000 base pairs firom the gene 
sequence. 

[110] Various real-time PGR methods including, e.g., Taqman or molecular 
beacon-based assays (e.g., U.S. Patent Nos. 5,210,015; 5,487,972; Tyagi et al. Nature 
Biotechnology 14:303 (1996); and PCT WO 95/13399 are useful to monitor for the presence 
of absence of a SNP. Additional SNP detection methods include, e.g., DNA sequencing, 
sequencing by hybridization, dot blotting, ohgonucleotide array (DNA Chip) hybridization 
analysis, or are described in, e.g., U.S. Patent No. 6,177,249; Landegren et al. Genome 
Research^ 8:769-776 (1998); Botstein et al.. Am J Human Genetics 32:314-331 (1980); 
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Meyers et al. Methods ifrenzymology 155:501-527 (1987); KeiWf al. Trends in Genetics 
7:5 (1991); Myers et al. Science 230:1242-1246 (1985); and Kwok et al. Genomics 23:138- 
144 (1994). 

V. IMMUNOLOGICAL DETECTION OF POLYPEPTIDES OF THE 
INVENTION 

[111] In addition to the detection of polynucloetides of the invention and 
gene expression using nucleic acid hybridization technology, one can also use immunoassays 
to detect polypeptides of the invention. Immunoassays can be used to qualitatively or 
quantitatively analyze polypeptides of the invention. A general overview of the applicable 
technology can be found in Harlow & Lane, Antibodies: A Laboratory Manual (1988). 

A. Antibodies to Target Proteins or other immunogens 

[112] Methods for producing polyclonal and monoclonal antibodies that 
react specifically with a protein of interest or other immunogen are known to those of skill in 
the art {see, e.g., Coligan, supra\ and Harlow and Lane, supra\ Stites et al, supra and 
references cited therem; Coding, supra\ and Kohler and Milstein Nature, 256:495-497 
(1975)). Such techniques include antibody prqparation by selection of antibodies from 
libraries of recombinant antibodies in phage or similar vectors {see^ Huse et aL, supra; and 
Ward et al., supra). For example, in order to produce antisera for use in an immunoassay, the 
protein of interest or an antigenic fragment tiiereof, is isolated as described hereia. For 
example, a recombinant protein is produced in a transformed cell line. An inbred strain of 
mice or rabbits is immxmized with the protein using a standard adjuvant, such as Freund's 
adjuvant, and a standard immunization protocol. Altematively, a synthetic peptide derived 
from the sequences disclosed herein is conjugated to a carrier protein and used as an 
immunogen. 

[113] Polyclonal sera are collected and titered against the immunogen in an 
immunoassay, for example, a solid phase imrnxmoassay with the immxmogen immobilized on 
a solid support. Polyclonal antisera with a titer of 10"* or greater are selected and tested for 
their crossreactivity against proteins other than the polypeptides of the invention or even 
other homologous proteins from other organisms, using a competitive bindmg immunoassay. 
Specific monoclonal and polyclonal antibodies and antisera will usually bind with a Kd of at 
least about 0.1 mM, more usually at least about 1 fiM, preferably at least about 0.1 jjM or 
better, and most preferably, 0.01 \xM or better. 
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[nber of proteim of flie invention cori^Bsing mmimogeM 
used to produce antibodies specifically or selectively reactive witti the proteins of interest. 
Recombinant protein is the preferred immunogen for the production of monoclonal or 
polyclonal antibodies. Naturally occurring protein may also be used either in pure or impure 
form. Synthetic peptides made using the protein sequences described herein may also be 
used as an immunogen for the production of antibodies to the protein. Recombinant protein 
can be expressed in eukaryotic or prokaryotic cells and purified as generally described supra. 
The product is then injected into an animal capable of producing antibodies. Either 
monoclonal or polyclonal antibodies may be generated for subsequent use in immunoassays 
to measure the protein. 

[115] Methods of production of polyclonal antibodies are known to those of 
skill in the art. In brief, an immunogen, preferably a purified protein, is mixed with an 
adjuvant and animals are immunized. The animal's immune response to the immunogen 
preparation is monitored by taking test bleeds and determining the titer of reactivity to 
polypeptides of the invention. When appropriately high titers of antibody to the immunogen 
are obtained, blood is collected from the animal and antisera are prepared. Further 
fractionation of the antisera to enrich for antibodies reactive to the protein can be done if 
desired {see^ Harlow and Lane, supra). 

[1 1 6] Monoclonal antibodies may be obtained using various techniques 
famiUar to those of skill in the art. Typically, spleen cells from an animal unmunized with a 
desired antigen are immortalized, commonly by fixsion with a myeloma cell {see, Kohler and 
Milstein, Eur. J. Immunol 6:511-519 (1976)). Altemative metiiods of immortalization 
include, e.g.^ transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other 
methods well known in the art. Colonies arising from single immortalized cells are screened 
for production of antibodies of the desired specificity and affinity for the antigen, and yield of 
the monoclonal antibodies produced by such cells may be enhanced by various techniques, 
including injection into the peritoneal cavity of a vertebrate host. Alternatively, one may 
isolate DNA sequences that encode a monoclonal antibody or a binding fragment thereof by 
screening a DNA library from hxmian B cells according to the general protocol outlined by 
Huse et aL, supra. 

[117] Once target immimogen-specific antibodies are available, the 
immunogen can be measured by a variety of immimoassay methods with qualitative and 
quantitative results available to the clinician. For a review of immunological and 
immunoassay procedures in general see, Stites, supra. Moreover, the immimoassays of the 
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formed in any of several configurati8l^\! 



present invention can beperformed in any of several configuratil^whicli are reviewed 
extensively in Maggio Enzyme Immunoassay^ CRC Press, Boca Raton, Florida (1980); 
Tijssen, supra; and Harlow and Lane, supra. 

[1 1 8] Immunoassays to measure target proteins in a human sample may use 
a polyclonal antiserum that was raised to fulHength polypeptides of the invention or a 
fragment thereof. This antiserum is selected to have low cross-reactivity against other 
proteins and any such cross-reactivity is removed by immunoabsorption prior to use in the 
immunoassay. 



B. Immunological Binding Assays 

[119] In some embodiments, a protein of interest is detected and/or 
quantified using any of a nmnber of well-known immunological binding assays (see, e.g,, 
U.S. Patents 4,366,241; 4,376,110; 4,517,288; and 4,837,168). For a review of the general 
immimoassays, see also Asai Methods in Cell Biology Volume 37: Antibodies in Cell 
Biology, Academic Press, Inc. NY (1993); Stites, supra. Immunological binding assays (or 
immunoassays) typically utilize a "capture agent" to specifically bind to and often 
immobilize the analyte (e.g., full-length polypeptides of the present invention, or antigenic 
subsequences thereof. The capture agent is a moiety that specifically binds to the analyte. 
The antibody may be produced by any of a nimiber of means well known to those of skill in 
the art and as described above. 

[120] Immunoassays also often utilize a labeling agent to bind specifically to 
and label the binding complex formed by the capture agent and the analyte. The labeling 
agent may itself be one of the moieties comprising the antibody/analyte complex, 
Altematively, the labeling agent may be a third moiety, such as another antibody, that 
specifically binds to the antibody/protein complex. 

[121] In a preferred embodiment, the labeling agent is a second antibody 
bearing a label. Altematively, the second antibody may lack a label, but it may, in turn, be 
bound by a labeled third antibody specific to antibodies of the species from which the second 
antibody is derived. The second antibody can be modified with a detectable moiety, such as 
biotin, to which a third labeled molecule can specifically bind, such as enzyme-labeled 
streptavidin. 

[122] Oflier proteins capable of specifically binding immunoglobulin 
constant regions, such as protein A or protein G, can also be used as the label agents. These 
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^ ents ofthe cell walls of streptococca^Rteria. They exhibit a 

strong non-immunogenic reactivity with immunoglobulin constant regions from a variety of 
species (see, generally, Kronval, etal. J. Immunol, 111:1401-1406 (1973); and Akerstrom, et 
al. J. Immunol, 135:2589-2542 (1985)). 

[1 23] Throughout the assays, incubation and/or washing steps may be 
required after each combination of reagents. Incubation steps can vary from about 5 seconds 
to several hours, preferably from about 5 minutes to about 24 hours. The incubation time will 
depend upon the assay format, analyte, volume of solution, concentrations, and the like. 
Usually, the assays will be carried out at ambient temperature, although they can be 
conducted over a range of temperatures, such as 10°C to 40°C. 

1- Non-Competitive Assav Formats . 

[124] Immunoassays for detecting proteins or analytes of interest from tissue 
samples may be either competitive or noncompetitive. Noncompetitive immunoassays are 
assays in which the amoimt of captured protein or analyte is directly measured. In one 
preferred "sandwich" assay, for example, the capture agent {e.g., antibodies specific for the 
polypeptides of the invention) can be bound directly to a solid substrate where it is 
immobilized. These immobiUzed aatibodies then capture the polypeptide present in the test 
sample. The polypeptide of the invention thus immobilized is then bound by a labeling 
agent, such as a second labelled antibody specific for the polypeptide. Alternatively, the 
second antibody may lack a label, but it may, in turn, be bound by a labeled third antibody 
specific to antibodies of the species from which the second antibody is derived. The second 
can be modified with a detectable moiety, such as biotin, to which a third labeled molecule 
can specifically bind, such as enzyme-labeled streptavidin. 

2, Competitive Assav Formats 

[125] In competitive assays, the amoxmt of protein or analyte present in the 
sample is measured indirectly by measuring the amount of an added (exogenous) protein or 
analyte displaced (or competed away) from a specific capture agent (e.g., antibodies specific 
for a polypeptide of the invention) by the protein or analyte present in the sample. The 
amount of immunogen bound to the antibody is inversely proportional to the concentration of 
immunogen present in the sample. In a particularly preferred embodiment, the antibody is 
immobilized on a solid substrate. The amoxmt of analyte may be detected by providing a 
labeled analyte molecule. It is understood that labels can include, e.g., radioactive labels as 
well as peptide or other tags that can be recognized by detection reagents such as antibodies. 
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[126] TWSRimnjiRssivR in the comnetitive binding iviat can be used for cross- 



■ ■ m 

unoassays in the competitive binding VHat 
reactivity detenninations. For example, the protein encoded by the sequences described 
herein can be immobilized on a soUd support. Proteins are added to the assay and compete 
with the binding of the antisera to the immobilized antigen. The abiUty of the above proteins 
to compete with the binding of the antisera to the immobilized protein is compared to that of 
the protein encoded by any of the sequences described herein. The percent cross-reactivity 
for the above proteins is calculated, usmg standard calculations. Those antisera with less than 
10% cross-reactivity with each of the proteins hsted above are selected and pooled. The 
cross-reacting antibodies are optionally removed from the pooled antisera by 
immunoabsorption with the considered proteins, e.g.. distantly related homologs. 

[127] The immunoabsorbed and pooled antisera are then used in a 
competitive binding immunoassay as described above to compare a second protein, thought 
to be perhaps a protein of the present invention, to the immunogen protein. In order to make 
this comparison, the two proteins are each assayed at a wide range of concentrations and the 
amount of each protein required to inhibit 50% of the binding of the antisera to the 
immobilized protein is determined. If the amoimt of the second protein required is less than 
10 times the amount of the protein partially encoded by a sequence herein that is required, 
then the second protein is said to specifically bind to an antibody generated to an immunogen 
consisting of the target protein. 

3. Other Assay Formats 

[128] In some embodiments, western blot (immunoblot) analysis is used to 
detect and quantify the presence of a polypeptide of the invention in the sample. The 
technique generally comprises separating sample proteins by gel electrophoresis on the basis 
of molecular weight, transfening the separated protems to a suitable solid support (such as, 

a nitrocellulose filter, a nylon filter, or a derivatized nylon filter) and incubating the 
sample with the antibodies tiiat specifically bind the protein of interest. For example, 
antibodies are selected that specifically bind to the polypeptides of the invention on the solid 
support. These antibodies may be directly labeled or altematively may be subsequentiy 
detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) that specifically 
bind to the antibodies against the protein of interest. 

[129] Other assay formats include liposome immunoassays (LIA), which use 
liposomes designed to bind specific molecules (e.g., antibodies) and release encapsulated 
reagents or markers. The released chemicals are then detected according to standard 
techniques (see, Monroe et al. (1986) Amer. Clin. Prod. Rev. 5:34-41). 
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4. Labels 

[130] The particular label or detectable group used in the assay is not a 
critical aspect of the invention, as long as it does not significantly interfere with the specific 
binding of the antibody used in the assay. The detectable group can be any material having a 
detectable physical or chemical property. Such detectable labels have been well-developed in 
the field of immunoassays and, in general, most labels usefiil in such methods can be applied 
to the present invention. Thus, a label is any composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, electrical, optical or chemical means. Useful 
labels in the present invention include magnetic beads {e.g., Dynabeads™), fluorescent dyes 
{e.g., fluorescein isothiocyanate, Texas red, rhodamine, and the Uke), radiolabels {e.g., ^H, 

^^S, ^"^C, or ^^P), enzymes {e.g., horse radish peroxidase, alkaline phosphatase and others 
commonly used in an ELISA), and colorimetric labels such as colloidal gold or colored glass 
or plastic {e.g., polystyrene, polypropylene, latex, etc.) beads. 

[131] The label may be coupled directly or indirectly to the desired 
component of the assay according to methods well known in the art. As indicated above, a 
wide variety of labels may be used, with the choice of label depending on the sensitivity 
required, the ease of conjugation with the compound, stability requirements, available 
instrumentation, and disposal provisions. 

[132] Non-radioactive labels are often attached by indirect means. The 
molecules can also be conjugated directly to signal generating compounds, e.g., by 
conjugation with an enzyme or fluorescent compound. A variety of enzymes and fluorescent 
compounds can be used with the methods of the present invention and are well-known to 
those of skill in the art (for a review of various labeling or signal producing systems which 
maybe used, see, e.g., U.S. Patent No. 4,391,904). 

[133] Means of detecting labels are well known to those of skill in the art. 
Thus, for example, where the label is a radioactive label, means for detection include a 
scintillation counter or photographic fihn as in autoradiography. Where the label is a 
fluorescent label, it may be detected by exciting the fluorochrome with the appropriate 
wavelength of Ught and detecting the resulting fluorescence. The fluorescence may be 
detected visually, by means of photographic film, by the use of electronic detectors such as 
charge coupled devices (CCDs) or photomultiphers and the like. Similarly, enzymatic labels 
may be detected by providing the appropriate substrates for the enzyme and detecting the 
resulting reaction product. Finally simple colorimetric labels may be detected directly by 
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observing the color associSed with the label. Thus, in various d|f^ck assays, conjugated 
gold often appears pink, while various conjugated beads appear the color of the bead. 

[134] Some assay formats do not require the use of labeled components. For 
instance, agglutination assays can be used to detect the presence of the target antibodies; Jn 
this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, none of the components need to be labeled and the presence of the 
target antibody is detected by simple visual inspection. 

VI. roENXmCATION OF MODULATORS OF POLYPEPTIDES OF THE 
INVENTION 

(135) Modulators of a polypeptide of the invention, i.e. agonists or 
antagonists of a polypeptide's activity, or polypeptide's or polynucleotide's expression, are 
useful for treating a number of human diseases, including diabetes. For example, 
administration of modulators can be used to treat diabetic patients or prediabetic individuals 
to prevent progression, and therefore symptoms, associated with diabetes (including insulin 
resistance). 



A. Agents that Modulate Polypeptides of the Invention 

[136] The agents tested as modulators of polypeptides of the invention can 

be any small chemical compound, or a biological entity, such as a protein, sugar, nucleic acid 

or lipid. Typically, test compounds will be small chemical molecules and peptides. 

Essentially any chemical compound can be used as a potential modulator or Ugand in the 

assays of the invention, although most often compounds that can be dissolved in aqueous or 

organic (especially DMSO-based) solutions are used. The assays are designed to screen large 

chemical Ubraries by automating the assay steps and providing compounds from any 

convenient source to assays, which are typically run in parallel (eg., in microtiter formats on 

microtiter plates in robotic assays). Modulators also include agents designed to reduce the 

level of mKNA encoding a polypeptide of the invention (e.g. antisense molecules, ribozymes, 

DNAzymes, small inhibitory RNAs and the like) or the level of translation from an mRNA 

(e.g., translation blockers such as an antisense molecules that are complementary to 

translation start or other sequences on an mRNA molecule). It will be appreciated that there 

are many suppUers of chemical compounds, including Sigma (St. Louis, MO), Aldrich (St. 

Louis, MO), Sigma- Aldrich (St. Louis, MO), Fluka Chemika-Biochemica Analytika (Buchs, 

Switzerland) and the like. 
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[137] ]^ome embodiments, high throughput scl 





Lg methods involve 



providing a combinatorial chemical or peptide library containing a large number of potential 
therapeutic compoxmds (potential modulator compounds). Such "combinatorial chemical 
libraries" or "ligand libraries" are then screened in one or more assays, as described herein, to 
5 identify those Ubrary members (particular chemical species or subclasses) that display a 
desired characteristic activity. The compounds thus identified can serve as conventional 
"lead compounds" or can themselves be used as potential or actual therapeutics. 



compounds generated by either chemical synthesis or biological synthesis, by combining a 

10 niraiber of chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library such as a pol3^eptide library is formed by combining a set of chemical 
building blocks (amino acids) in every possible way for a given compound length (i.e., the 
number of amino acids in a polypeptide compound). Millions of chemical compounds can be 
synthesized through such combinatorial mixing of chemical building blocks. 

15 [1391 Preparation and screening of combinatorial chemical libraries is well 

known to those of skill in the art. Such combinatorial chemical libraries include, but are not 
limited to, peptide libraries {see, e.g., U.S. Patent 5,010,175, Furka, Int. J. Pept. Prot Res. 
37:487-493 (1991) and Houghton et al. Nature 354:84-88 (1991)). Other chemistries for 
generating chemical diversity libraries can also be used. Such chemistries include, but are 

20 not Umited to: peptoids (e.g., PCT Publication No. WO 91/19735), encoded peptides {e.g., 
PCX Publication WO 93/20242), random bio-oligomers (e.^., PCT Publication No. WO 
92/00091), benzodiazepines (e.g.^ U.S. Pat No. 5,288,514), diversomers such as hydantoins, 
benzodiazepines and dipeptides (Hobbs etal, Proc. Nat Acad. Set USA 90:6909-6913 
(1993)), vinylogous polypeptides (Hagihara et al, J. Amer. Chem. Soc. 114:6568 (1992)), 

25 nonpeptidal peptidomimetics with glucose scaffolding (Hirschmann et al, J. Amer. Chem. 

Soc. 114:9217-9218 (1992)), analogous organic syntheses of small compound Ubraries (Chen 
et aL, J. Amer. Chem. Soc. 1 16:2661 (1994)), oligocarbamates (Cho et al. Science 261:1303 
(1993)), and/or peptidyl phosphonates (Campbell et al, J. Org. Chem. 59:658 (1994)), 
nucleic acid libraries {see Axxsubel, Berger and Sambrook, all supra)^ peptide nucleic acid 

30 libraries {see, e.g., U.S. Patent 5,539,083), antibody libraries {see, e.g., Vaughn et al. Nature 
Biotechnology, 14(3):309-314 (1996) and PCT/US96/1 0287), carbohydrate libraries {see, 
e.g., Liang et al. Science, 274:1520-1522 (1996) and U.S. Patent 5,593;853), small organic 
molecule libraries {see, e.g., benzodiazepines, Baum C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent 5,569,588; thiazoUdinones and metathiazanones, U.S. Patent 



[138] A combinatorial chemical library is a collection of diverse chemical 
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5,549,974; pyrroUdines^^. Patents 5,525,735 and 5,519,134; jUiolino compounds, U.S. 
Patent 5,506,337; benzodiazepines, 5,288,514, and the like). 

[140] Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Wobum, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA), lii addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g.. ComGenex, Princeton, N.J., Tripos, Inc., St. Louis, MO, 3D 
Pharmaceuticals, Exton, PA, Martek Biosciences, Columbia, MD, etc.). 



10 B. Methods of Screening for Modulators of the Polypeptides of the Invention 

[141] A number of different screening protocols can be utilized to identify 
agents that modulate the level of expression or activity of a polynucleotide of a polypeptide 
of the invention in cells, particularly mammalian cells, and especially human cells. In 
general terms, the screening methods involve screening a plurality of agents to identify an 
1 5 agent that modulates the activity of a polypeptide of the invention by, e.g., binding to the 
polypeptide, preventing an mhibitor or activator from binding to the polypeptide, increasing 
association of an inhibitor or activator with the polypeptide, or activating or inhibiting 
expression of ttie polypeptide. 

[142] Any cell expressing a fiill-length polypeptide of the invention or a 
20 fragment thereof can be used to identify modulators. In some embodiments, the cells are 
eukaryotic cells lines (e.g., CHO or HEK293) transformed to express a heterologous 
polypeptide of the invention. In some embodiments, a cell expressing an endogenous 
polypeptide of the invention is used in screens, hi ottier embodiments, modulators are 
screened for their ability to effect insulin responses. 

25 

1. Polvpeptide Binding Assays 

[1 43] Preliminary screens can be conducted by screening for agents capable 
of binding to polypeptides of the invention, as at least some of the agents so identified are 
likely modulators of a polypeptide of the invention. Binding assays are also useful, e.g., for 
30 identifying endogenous proteins that interact with polyp^tides of the invention. For 

example, antibodies, receptors or other molecules that bind polypeptides of the invention can 
be identified m binding assays. 
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am 
g assays usually involve contacting l^lypqptide of the 

invention with one or more test agents and allowing sufficient time for the protein and test 

agents to form a binding complex. Any binding complexes formed can be detected using any 

of a number of estabUshed analytical techniques. Protein binding assays mclude, but are not 

Umited to, methods that measure co-precipitation or co-migration on non-denaturing SDS- 

polyacrylamide gels, and co-migration on Western blots {see, e.g., Bemiet, J.P. and 

Yamamura, H.L (1985) ♦'Neurotransmitter, Hormone or Drug Receptor Binding Methods," in 

Neurotransmitter Receptor Binding (Yamamura, H. I., et al, eds.), pp. 61-89. Other binding 

assays involve the use of mass spectrometry or NMR techniques to identify molecules bound 

to a polypeptide of the invention or displacement of labeled substrates. The polypeptides of 

the invention utilized in such assays can be naturally expressed, cloned or synthesized. 

[1451 In addition, mammalian or yeast two-hybrid approaches {see, e.g., 

Bartel, P.L. et. al. Methods Enzymol, 254:241 (1995)) can be used to identify polypeptides or 

other molecvdes that interact or bind when expressed together in a host cell. 



2. Polypeptide Activity 

[146] The activity of polypeptides of the invention can be assessed using a 
variety of in vitro and in vivo assays to determine functional, chemical, and physical effects, 
e.g., measuring ligand binding {e.g., radioactive or otherwise labeled ligand binding), second 
messengers (e.g., cAMP, cGMP, IP3, DAG, or Ca^"^, ion flux, phosphorylation levels, 
transCTiption levels, and the like. Furthermore, such assays can be used to test for inhibitors 
and activators of the polypeptides of the invention. Modulators can also be genetically 
altered versions of polypeptides of the invention. 

[147] The polypeptide of the assay will be selected from a polypeptide with 
substantial identify to a sequence of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
NO:8, SEQ ID NO:10, SEQ ID NO:12 or SEQ ID NO:14 or other conservatively modified 
variants thereof Generally, the amino acid sequence identify will be at least 70%, optionally 
at least 85%, optionally at least 90-95% to the polypeptides exemplified herein. Optionally, 
the polypeptide of the assays will comprise a fragment of a polypeptide of the invention, such 
as an extracellular domain, transmembrane domain, cytoplasmic domain, ligand binding 
domain, subimit association domain, active site, and the like. Either a polypeptide of the 
invention or a domain thereof can be covalently linked to a heterologous protein to create a 
chimeric protein used in the assays described herein. 
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[148] Ntodulators of polypeptide activity are testlRsing either recombinant 
or naturally occurring polypeptides of the invention. The protein can be isolated, expressed 
in a cell, expressed in a membrane derived from a cell, expressed in tissue or in an animal, 
either recombinant or naturally occurring. For example, tissue shoes, dissociated cells, e.g., 
from tissues expressing polypeptides of the invention, transformed cells, or membranes can 
be used. Modulation is tested using one of the in vitro or in vivo assays described herein. 

[149] Modulator binding to polypeptides of the invention, a domain, or 
chimeric protein can be tested in solution, in a bilayer membrane, attached to a solid phase, in 
a lipid monolayer, or in vesicles. Binding of a modulator can be tested using, e.g., changes in 
spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), hydrodynamic 
(e.g., shape), chromatographic, or solubiUty properties. 

[150] Samples or assays that are treated with a potential moulator (e.g., a 
"test compound") are compared to control san^les without the test compound, to examine tiie 
extent of modulation. Control samples (untreated with activators or inhibitors) are assigned a 
relative activity value of 100. Inhibition of the polypeptides of the invention is achieved 
when the activity value relative to the control is about 90%, optionally 50%, optionally 25- 
0%. Activation of the polypeptides of the invention is achieved when the activity value 
relative to the control is 110%, optionally 150%, 200%, 300%, 400%, 500%, or 1000-2000%. 



polynucleotide or a polyp^tide of the invention is also provided. Screening methods 
generally involve conducting cell-based assays in which test compounds are contacted with 
one or more cells expressing a polynucleotide or a polypeptide of the invention, and then 
detecting an increase or decrease in expression (either transcript or translation product). 
Assays can be performed with any cells that express a polynucleotide or a polypeptide of the 
invention. 

[152] Expression can be detected in a number of diffarent ways. As 
described infra, the expression level of a polynucleotide of the invention in a cell can be 
determined by probing the mRNA expressed in a cell with a probe that specifically hybridizes 
with a transcript (or complementary nucleic acid derived therefrom) of a polynucleotide of 
the invention. Probing can be conducted by lysing the cells and conducting Northern blots or 
without lysing the cells using in ji<M-hybridization techniques. Alternatively, a polypeptide 



3. Expression Assavs 

[151] Screening for a compound that modulates the expression of a 
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;ted using immunological methods li^Kich a cell lysate is 

probed with antibodies that specifically bind to the polypeptide. 

[153] The level of e3q)ression or activity of a polynucleotide or a polypeptide 
of the invention can be compared to a baseline value. The baseline value can be a value for a 
control sample or a statistical value that is representative of expression levels of a 
polynucleotide or a polypeptide of the invention for a control population (e.g., lean 
individuals as described herein) or cells (e.g., tissue culture cells not exposed to a modulator). 
Expression levels can also be determined for cells that do not express the polynucleotide or a 
polypeptide of the invention as a negative control. Such cells generally are otherwise 
substantially genetically the same as the test cells. 

[154] A variety of different types of cells can be utilized in the r^orter 
assays. Cells that do not endogenously express a polypeptide of the invention can be 
prokaryotic, but are preferably eukaryotic. The eukaryotic cells can be any of the cells 
typically utilized in generating cells that harbor recombinant nucleic acid constructs. 
Exemplary eukaryotic cells include, but are not limited to, yeast, and various higher 
eukaryotic cells such as the HEK293, HepG2, COS, CHO and HeLa cell lines. 

[155] Various controls can be conducted to ensure that an observed activity 
is authentic including running parallel reactions with cells that lack the reporter construct or 
by not contacting a cell harboring the reporter construct with test compound. Compounds 
can also be further validated as described below. 

4. Validation 

[156] Agents that are initially identified by any of the foregoing screening 
methods can be fiirther tested to validate the apparent activity. Modulators that are selected 
for fiirther study can be tested on the "classic" insulin responsive cell Ime, mouse 3T3-L1 
adipocytes, muscle ceUs such as L6 cells and the like. Cells (e.g., adipocytes or muscle cells) 
are pre-incubated with the modulators and tested for acute (up to 4 hours) and chronic 
(overnight) effects on basal and insulin-stimulated GLUT4 translocation and glucose uptake. 

[1571 Following such studies, vaUdity of the modulators is tested in suitable 
animal models. The basic format of such methods involves administering a lead compound 
identified during an initial screen to an animal that serves as a model for himians and then 
determining if expressionof activity of a polypeptide of the invention is in fact modiilated. 

[158] The effect of the compound will be assessed in either diabetic animals 
or in diet induced insulin resistant animals. The blood glucose and insulin levels will be 
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determined. The animalmodels utilized in validation studies gdMly are mammals of any 
kind. Specific examples of suitable animals include, but are not limited to, primates, mice 
and rats. For example, monogenic models of diabetes (e.g., ob/ob and db/db mice, Zucker 
rats and Zucker Diabetic Fatty rats etc) or polygenic models of diabetes (e.g., OLETF rats, 
GK rats, NSY mice, and KK mice) can be useM for validating modulation of a polypeptide 
of the invention in a diabetic or insulin resistant animal, hi addition, transgenic animals 
e3q)ressing human polypeptides of the invention can be used to further validate drug 
candidates. 



C. Solid Phase and Soluble High Throughput Assays 

[159] In the high throughput assays of the invention, it is possible to screen 
up to several thousand different modulators or ligands in a single day. M particular, each 
well of a microtiter plate can be used to run a separate assay against a selected potential 
modulator, or, if concentration or incubation time effects are to be observed, every 5-10 wells 
can test a single modulator. Thus, a smgle standard microtiter plate can assay about 100 
ie.g., 96) modulators. If 1536 well plates are used, then a single plate can easily assay from 
about 100 to about 1500 different compounds. It is possible to assay several different plates 
per day; assay screens for up to about 6,000-20,000 or more different compounds are possible 
usmg the integrated systems of the invraition. In addition, microfluidic approaches to reagent 
manipulation can be used. 

[1601 A molecule of interest (e.g., a polypeptide or polynucleotide of the 
invention, or a modulator thereof) can be bound to Ihe soUd-state component, directly or 
indirectly, via covalent or non-covalent linkage, e.g. , via a tag. The tag can be any of a 
variety of components. In general, a molecvde that binds the tag (a tag binder) is fixed to a 
soUd support, and the tagged molecule of interest is attached to the sohd support by 
interaction of the tag and the tag binder. 

[161] A number of tags and tag binders can be used, based upon known 
molecular interactions well described in the hterature. For example, where a tag has a natural 
binder, for example, biotm, protein A, or protein G, it can be used in conjunction with 
appropriate tag binders (avidin, streptavidin, neutravidin, the Fc region of an 
unmunoglobuhn, poly-His, etc.) Antibodies to molecules with natural bmders such as biotin 
are also widely available and appropriate tag binders (see. SIGMA Immunochemicals 1998 
catalogue SIGMA, St. Louis MO). 



wo 03/102161 



PCT/US03/17725 




42 




[162] 



ilarly, any haptenic or antigenic com] 



id can be used in 



combination with an appropriate antibody to form a tag/tag binder pair. Thousands of 
specific antibodies are commercially available and many additional antibodies are described 
in the literature. For example, in one common configuration, the tag is a first antibody and 
5 the tag binder is a second antibody that recognizes the first antibody. In addition to antibody- 
antigen interactions, receptor-Ugand interactions are also appropriate as tag and tag-binder 
pairs, such as agonists and antagonists of cell membrane receptors {e.g., cell receptor-Ugand 
interactions such as transferrin, c-kit, viral receptor ligands, cytokine receptors, chemokine 
recqjtors, interleukin receptors, immunoglobulin receptors and antibodies, the cadherin 

10 family, the integrin family, the selectin family, and the like; see, e.g., Pigott & Power, The 
Adhesion Molecule Facts Book I (1993)). Similarly, toxins and venoms, viral epitopes, 
hormones (e,g., opiates, steroids, etc), intracellular receptors (e.g., which mediate the effects 
of various small ligands, including steroids, thyroid hormone, retinoids and vitamin D; 
peptides), drags, lectins, sugars, nucleic acids (both linear and cyclic polymer 

15 configurations), oligosaccharides, proteins, phospholipids and antibodies can all interact with 
various cell receptors. 



polyureas, polyamides, polyethyleneimines, polyarylene sulfides, polysiloxanes, polyimides, 
and polyacetates can also form an appropriate tag or tag binder. Many other tag/tag binder 
20 pairs are also usefiil in assay systems described herein, as would be apparent to one of skill 
upon review of this disclosure. 



serve as tags, and include polypeptide sequences, such as poly-gly sequences of between 
about 5 and 200 amino acids. Such flexible linkers are known to those of skill in the art. For 
25 example, poly(ethelyne glycol) linkers are available from Shearwater Polymers, Inc., 

Huntsville, Alabama. These linkers optionally have amide linkages, sulfliydryl linkages, or 
heterofimctional linkages. 



methods currently available. Solid substrates are commonly derivatized or fimctionalized by 
30 exposing all or a portion of the substrate to a chemical reagent that fixes a chemical group to 
the surface that is reactive with a portion of the tag binder. For example, groups that are 
suitable for attachment to a longer chain portion would include amines, hydroxyl, thiol, and 
carboxyl groups. Aminoalkylsilanes and hydroxyalkylsilanes can be used to fimctionalize a 
variety of surfaces, such as glass surfaces. The construction of such solid phase biopolymer 



[163] Synthetic polymers, such as polyurethanes, polyesters, polycarbonates. 



[164] Common linkers such as peptides, polyethers, and the like can also 



[165] Tag binders are fixed to solid substrates using any of a variety of 
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arrays is well describe^n the literature (see^ e.g., Merrifield, J^^^ Chem. Soa 85:2149- 
2154 (1963) (describing solid phase synthesis of, e.g., peptides); Geysen et al, J. Immun, 
Meth, 102:259-274 (1987) (describing synthesis of solid phase components on pins); Frank 
and Doring, Tetrahedron 44:60316040 (1988) (describing synthesis of various peptide 
5 sequences on cellulose disks); Fodor et aU Science, 251:161-111 (1991); Sheldon et al. 
Clinical Chemistry 39(4):718"719 (1993); and Kozal et al. Nature Medicine 2(7):753759 
(1996) (all describing arrays of biopolymers fixed to soUd substrates). Non-chemical 
approaches for fixing tag binders to substrates include other common methods, such as heat, 
cross-linking by UV radiation, and the like. 
10 [1 66] The invention provides in vitro assays for identifying, in a high 

throughput format, compounds that can modulate the expression or activity of a polypeptide 
of the invention. Control reactions that measure activity of a polypeptide of the invention in 
a cell in a reaction that does not include a potential modulator are optional, as the assays are 
highly uniform. Such optional control reactions are appropriate and increase the reliability of 
1 5 the assay. Accordingly, in some embodiments, the methods of the invention include such a 
control reaction. For each of the assay formats described, "no modulator^' control reactions 
that do not include a modidator provide a backgroimd level of binding activity. 



two types of positive controls are appropriate. First, a known activator of a polypeptide or a 
20 polynucleotide of the invention can be incubated with one sample of the assay, and the ^ 
resulting increase in si^al resulting firom an increased expression level or activity of a 
polypeptide or a polynucleotide of the invention are determined according to the methods 
herein. Second, a known mhibitor of a polypeptide or a polynucleotide of the invention can 
be added, and tiie resulting decrease in signal for the expression or activity of a polypeptide 
25 or a polynucleotide of the invention can be similarly detected. It will be appreciated that 

modulators can also be combined with activators or inhibitors to find modulators that inhibit 
the increase or decrease that is otherwise caused by the presence of the known modulator of a 
polypeptide or a polynucleotide of the invention. 



practicing the assays described herein using nucleic acids or polypeptides of the invention, 
antibodies, etc. 



[1671 hi some assays it will be desirable to have positive controls. At least 



Vn. COMPOSITIONS, KITS AND INTEGRATED SYSTEMS 



30 



[168] The invention provides compositions, kits and integrated systems for 
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[169] Tneinvention provides assay compositiofl^ror use in solid phase 
assays; such compositions can include, for example, one or more nucleic acids encoding a 
polypeptide of the invention immobilized on a solid support, and a labeling reagent. In each 



hybridization. Modulators of expression or activity of a polypeptide of the invention can also 
be included in the assay compositions. 



invention. The kits typically include a probe that comprises an antibody that specifically 
binds to a polypeptide of the invention or a polynucleotide sequence encoding such 
polypeptides, and a label for detecting the presence of the probe. The kits may include at 
least one polynucleotide sequence encoding a polypeptide of the invention. Kits can include 
any of the compositions noted above, and optionally further include additional componrats 
such as instructions to practice a high-throughput method of assaying for an effect on 
expression of the genes encoding a polypeptide of the invention, or on activity of a 
polypeptide of the invention, one or more containers or compartments (e.g., to hold the probe, 
labels, or the like), a control modulator of the expression or activity of a polypeptide of the 
invention, a robotic armature for mixing kit components or the like. 

[171] The invention also provides integrated systems for high-throughput 
screening of potential modulators for an effect on the expression or activity of a polypeptide 
of the invention. The systems can mclude a robotic armature v^hich transfers fluid from a 
source to a destination, a controller which controls the robotic armature, a label detector, a 
data storage U3tut which records label detection, and an assay component such as a noicrotiter 
dish comprising a well having a reaction mixture or a substrate comprising a fixed nucleic 
acid or immobilization moiety. 

[1 721 A number of robotic fluid transfer systems are available, or can easily 
be made fi:om existing components. For example, a Zymate XP (Zymark Corporation; 
Hopkinton, MA) automated robot using a Microlab 2200 (Hamilton; Reno, NV) pipetting 
station can be used to transfer parallel samples to 96 well microtiter plates to set up several 
parallel simultaneous binding assays. 

[1731 Optical images viewed (and, optionally, recorded) by a camera or other 
recording device {e,g„ a photodiode and data storage device) are optionally fiuther processed 
in any of the embodiments herein, e.g., by digitizing the image and storing and analyzing the 
image on a computer. A variety of commercially available peripheral equipment and 



case, the assay compositions can also include additional reagents that are desirable for 



[170] The invention also provides kits for carrying out the assays of the 
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software is available fo^gitizing, storing and analyzing a digitIK video or digitized optical 
image, 

[174] One conventional system carries light from the specimen field to a 
cooled charge-coupled device (CCD) camera, in common use in ttie art. A CCD camera 
includes an array of picture elements (pixels). The Ught from the specimen is imaged on the 
CCD. Particular pixels corresponding to regions of the specimen {e.g., individual 
hybridization sites on an array of biological polymers) are sampled to obtain light intensity 
readings for each position. Multiple pixels are processed in parallel to increase speed. The 
apparatus and methods of the invention are easily used for viewing any sample, e.g.y by 
fluorescent or dark field microscopic techniques. 

Vm. ADMINISTRATION AND PHARMACEUTICAL COMPOSITIONS 

[175] Modulators of the polypeptides of the invention (e.g., antagonists or 
agonists) can be administered directly to the mammaUan subject for modulation of activity of 
a polypeptide of the invention in vivo. Administration is by any of the routes normally used 
for introducing a modulator compound into ultunate contact with the tissue to be treated and 
is well known to those of skill in the art. Although more tiian one route can be used to 
administer a particular composition, a particular route can often provide a more immediate 
and more effective reaction than another route. 

[176] The pharmaceutical compositions of the invention may comprise a 
pharmaceutically acceptable carrier. Pharmaceutically acceptable carriers are determined in 
part by the particular composition being administered, as well as by the particular method 
used to administer the composition. Accordingly, there are a wide variety of suitable 
formulations of pharmaceutical compositions of the present invention (see, e.g.. Remington's 
Pharmaceutical Sciences, 17*^ ed. 1985)). 

[177] The modulators (e.g., agonists or antagonists) of the expression or 
activity of the a polypeptide of the invention, alone or in combination with other suitable 
components, can be prepared for injection or for use in a pump device. Pump devices (also 
known as "insulin pumps'*) are commonly used to administer insulin to patients and therefore 
can be easily adapted to include compositions of the present invention. Manufacturers of 
insulin pumps include Animas, Disetronic and MiniMed. 

[178] The modulators (e.g., agonists or antagonists) of the expression or 
activity of a polypeptide of the invention, alone or m combination wifli other suitable 
components, can be made into aerosol formulations (z.e., they can be •^nebulized'*) to be 
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administered via inhalation. Aerosol formulations can be placed^R pressurized acceptable 
propellents, such as dichlorodifluoromethane, propane, nitrogen, and the like. 

[179] Formulations suitable for administration include aqueous and non- 
aqueoxis solutions, isotonic sterile solutions, which can contain antioxidants, buffers, 
bacteriostats, and solutes that render the foraiulation isotonic, and aqueous and non-aqueous 
sterile suspensions that can include suspending agents, solubilizers, thickening agents, 
stabilizers, and preservatives. In the practice of this invention, compositions can be 
administered, for example, orally, nasally, topically, intravenously, intraperitoneally, or 
intrathecally. The formulations of compounds can be presented in unit-dose or multi-dose 
sealed containers, such as ampoules and vials. Solutions and suspensions can be prepared 
from sterile powders, granules, and tablets of the kind previously described. The modxilators 
can also be administered as part of a prepared food or dmg. 

[1801 The dose administered to a patient, in the context of the present 
invention should be sufficient to induce a beneficial response in the subject over time. The 
optimal dose level for any patient will depend on a variety of factors including the efficacy of 
the specific modulator employed, the age, body weight, physical activity, and diet of the 
patient, on a possible combination with other drugs, and on the severity of the case of 
diabetes. It is recommended that the daily dosage of the modulator be determined for each 
individual patient by those skilled in the art in a similar way as for known insulin 
compositions. The size of tihe dose also will be determined by the existence, nature, and 
Qxt&at of any adverse side-effects that accompany the administration of a particular 
compound or vector in a particidar subject. 

[181] In determining the effective amount of the modxilator to be 
administered a physician may evaluate circulating plasma levels of the modulator, modulator 
toxicity, and the production of anti-modulator antibodies. In general, the dose eqxiivalent of a 
modulator is from about 1 ng/kg to 10 mg/kg for a typical subject. 

[1 82] For administration, modulators of the present invention can be 
administered at a rate determined by the LD-50 of the modulator, and the side-effects of the 
modulator at various concentrations, as applied to the mass and overall health of the subject. 
Administration can be accomplished via single or divided doses. 

[1 83] The compounds of the present invention can also be used effectively in 
combination with one or more additional active agents depending on the desired target 
therapy (see, e.g.. Turner, N. et al. Prog. Drug Res. (1998) 51: 33-94; Haffiier, S. Diabetes 
Care (1998) 21 : 160-178; and DeFronzo, R. et al, (eds.). Diabetes Reviews (1997) Vol. 5 No, 
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ve investigated the benefits of combiHRon there^ies with oral 

agents (see, e.g., Mahler, R., J. Clin. Endocrinol Metab. (1999) 84: 1165-71; United 

Kingdom Prospective Diabetes Study Group: UKPDS 28, Diabetes Care (1998) 21: 87-92; 

Bardin, C. W.,(ed.), Current Tlierapyln Endocrinology And Metabolism, 6th Edition (Mosby 

5 - Year Book, Inc., St. Louis, MO 1997); Chiasson, J. et al., Ann. Intern. Med. (1994) 121: 

928-935; Coniff, R. et al., Clin. Titer. (1997) 19: 16-26; Coniff, R. et al.. Am. J. Med (1995) 

98: 443-451; and Iwamoto, Y. et al., Diabet. Med. (1996) 13 365-370; Kwiterovich, P. Am. J. 

Cardiol (1998) 82(12A): 3U-17U). These studies indicate that modulation of diabetes, 

among other diseases, can be fiuther improved by the addition of a second agent to the 

1 0 therapeutic regimen. Combination therapy includes administration of a single pharmaceutical 
dosage formulation that contains a modulator of the invention and one or more additional 
active agents, as well as administration of a modulator and each active agent in its own 
separate pharmaceutical dosage formulation. For example, a modulator and a 
thiazolidinedione can be administered to the human subject together in a single oral dosage 

15 composition, such as a tablet or capsule, or each agent can be administered in separate oral 
dosage formulations. Where separate dosage formulations are used, a modulator and one or 
more additional active agents can be administered at essentially the same time (i.e., 
concurrently), or at separately staggered times (i.e., sequentially). Combination therapy is 
understood to include all these regimens. 

20 [184] One example of combination therapy can be seen in treating pre- 

diabetic individuals (e.g., to prevent progression into type 2 diabetes) or diabetic individuals 
(or treating diabetes and its related symptoms, complications, and disorders), wherein the 
modulators can be effectively used in combination with, for example, sulfonylureas (such as 
chlorpropamide, tolbutamide, acetohexamide, tolazamide, glyburide, gliclazide, glynase, 

25 glimepiride, and gUpizide); biguanides (such as metformin); a PPAR beta delta agonist; a 
ligand or agonist of PPAR gamma such as thiazoUdinediones (such as ciglitazone, 
pioglitazone (see, e.g., U.S. Patent No. 6,218,409), trogUtazone, and rosiglitazone (see, e.g., 
U.S. Patent No. 5,859,037)); PPAR alpha agonists such as clofibrate, gemfibrozil, 
fenofibrate, ciprofibrate, and bezafibrate; dehydroepiandrosterone (also referred to as DHEA 

30 or its conjugated sulphate ester, DHEA-S04); antiglucocorticoids; TNFa inhibitors; a- 
glucosidase inhibitors (such as acarbose, miglitol, and voglibose); amylin and amylin 
derivatives (such as pramUntide, (see, also, U.S. Patent Nos. 5,902,726; 5,124,314; 5,175,145 
and 6,143,718.)); insulin secretogogues (such as repaghnide, gliquidone, and nateglinide (see, 
also, U.S. Patent Nos. 6,251,856; 6,251,865; 6,221,633; 6,174,856)), and insulin. 
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DC GENE TEffi 

[1 85] Conventional viral and non-viral based gene transfer methods can be 
used to introduce nucleic acids encoding engineered polypeptides of the invention in 
mammalian cells or target tissues. Such methods can be used to administer nucleic acids 
5 encoding polypeptides of the invention to cells in vitro. In some embodiments, the nucleic 
acids encoding polypeptides of the invention are administered for in vivo or ex vivo gene 
therapy \xses. Non-viral vector delivery systems include DNA plasmids, naked nucleic acid, 
and nucleic acid complexed with a delivery vehicle such as a liposome. Viral vector delivery 
systems include DNA and KNA viruses, which have either episomal or integrated genomes 
1 0 after delivery to the cell. For a review of gene therapy procedures, see Anderson, Science 
256:808-813 (1992); Nabel & Feigner, TiBZBCff 11:211-217 (1993); Mitani & Caskey, 
IWTECH 11:162-166 (1993); Dillon, TIBTECH 11:167 A75 (1993); MUler, Nature 357:455- 
460 (1992); Van Brunt, Biotechnology 6(10): 1149-1 154 (1988); Vigne, Restorative 
Neurology and Neuroscience 8:35-36 (1995); Kremer & Perricaudet, British Medical Bulletin 
15 51(l):31-44 (1995); Haddada et al, in Current Topics in Microbiology and Immunology 
Doerfler and Bohm (eds) (1995); and Yu et aL, Gene Therapy 1:13-26 (1994). 

[186] Methods of non-viral delivery of nucleic acids encoding engineered 
polypeptides of the invention include lipofection, microinjection, biolistics, virosomes, 
liposomes, immimoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, 
20 artificial virions, and agent-enhanced uptake of DNA. Lipofection is described in e.g., US 
5,049,386, US 4,946,787; and US 4,897,355) and Upofection reagents are sold commercially 
(e.g., Transfectam™ and Lipofectin'^^. Cationic and neutral lipids that are suitable for 
efficient receptor-recognition lipofection of polynucleotides include those of Feigner, WO 
91/17424, WO 91/16024. Delivery can be to cells {ex vivo aduMnistration) or target tissues 
25 {in vivo administration). 

[1 871 The preparation of lipid:nucleic acid complexes, including targeted 
liposomes such as immimolipid complexes, is weU known to one of skill in the art {see, e.g.. 
Crystal, Science 270:404-410 (1995); Blaese et al. Cancer Gene TJier. 2:291-297 (1995); 
Behr et al., Bioconjugate Chem. 5:382-389 (1994); Remy et al, Bioconjugate Chem. 5:647- 
30 654 (1994); Gao et al, Gene Therapy 2:710-722 (1995); Ahmad et al. Cancer Res. 52:4817- 
4820 (1992); U.S. Pat Nos. 4,186,183, 4,217,344, 4,235,871, 4,261,975, 4,485,054, 
4,501,728, 4,774,085, 4,837,028, and 4,946,787). 

[188] The use of RNA or DNA viral based systems for the delivery of 
nucleic acids encoding engineered polypeptides of the invention take advantage of highly 
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evolved processes for targeting a virus to specific cells in the bofj^d trafficking the viral 
payload to the nucleus. Viral vectors can be administered directly to patients (in vivo) or they 
can be used to treat cells in vitro and the modified cells are administered to patients (ex vivo). 
Conventional viral based systems for the delivery of polypeptides of the invention could 
include retroviral, lentivmis, adenoviral, adeno-associated and herpes simplex virus vectors 
for gene transfer. Viral vectors are currently the most efficient and versatile method of gene 
transfer in target cells and tissues. Integration in the host genome is possible with the 
retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in 
long terai expression of the inserted transgene. Additionally, high transduction efficiencies 
have been observed in many different cell types and target tissues. 

[189] The tropism of a retrovirus can be altered by incorporating foreign 
envelope proteins, expanding the potential target population of target cells. Lentiviral vectors 
are retroviral vectors that are able to transduce or infect non-dividing cells and typically 
produce high viral titers. Selection of a retroviral gene transfer system would therefore 
depend on the target tissue. Retroviral vectors are comprised of cz.y-acting long terminal 
repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis- 
acting LTRs are sufficient for replication and packaging of the vectors, which are then used 
to integrate the therapeutic gene into the target cell to provide permanent transgene 
expression. Widely used retroviral vectors include those based upon murine leukemia virus 
(MuLV), gibbon ape leukemia viras (GaLV), Simian Immuno deficiency virus (SIV), human 
immxmo deficiency vims (inV), and combinations thereof (seCy e.g„ Buchscher et al^J. 
Virol. 66:2731-2739 (1992); Johannes a/., J. Virol 66:1635-1640 (1992); Sommerfelt a/., 
Virol 176:58-59 (1990); Wilson a/., J. Virol 63:2374-2378 (1989); Miller a/., J. Virol 
65:2220-2224 (1991); PCTAJS94/05700). 

[190] In applications where transient expression of the polypeptides of the 
invention is preferred, adenoviral based systems are typically used. Adenoviral based vectors 
are capable of very high transduction efficiency in many cell types and do not require cell 
division. With such vectors, high titer and levels of expression have been obtained. This 
vector can be produced in large quantities in a relatively simple system. Adeno-associated 
virus ("AAV**) vectors are also used to transduce cells with target nucleic acids, e.g., in the in 
vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy 
procedures (see, e,g.. West etal. Virology 160:38-47 (1987); U.S. Patent No. 4,797,368; WO 
93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, /. Clin. Invest. 
94:1351 (1994)). Constmction of recombinant AAV vectors is described in a number of 
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^ ^ Pat No. 5,173,414; Tratschin et aL^^. Cell Biol 5:3251-3260 

(1985); Tratschin, et al, Mol Cell Biol 4:2072-2081 (1984); Hennonat & Muzyczka, PNAS 
81:6466-6470 (1984); and Samulski et al, J. Virol 63:03822-3828 (1989). 

[191] pLASN and MFG-S are examples are retroviral vectors that have been 
used in clinical trials (Dunbar et al. Blood 85:3048-305 (1995); Kohn et al, Nat, Med. 
1:1017-102 (1995); Maleche^a/., PNAS94:22 12133-12138 (1997)), PA317/pLASN was 
the first therapeutic vector used in a gene ttierapy trial. (Blaese et al. Science 270:475-480 
(1995)). Transduction efficiencies of 50% or greater have been observed for MFG-S 
packaged vectors. (Ellem et al, Immunol Immunother. 44(l):10-20 (1997); Dranoff al. 
Hum. Gene Ther. 1:111-2 (1997), 

[192] Recombinant adeno-associated virus vectors (rAAV) are a promising 
alternative gene delivery systems based on the defective and nonpathogenic parvovuiis 
adeno-associated type 2 virus. All vectors are derived from a plasmid that retains only the 
AAV 145 bp inverted terminal repeats flanking the transgene expression cassette. Efficient 
gene transfer and stable transgene delivery due to integration into the genomes of the 
transduced cell are key features for this vector system. (Wagner et al. Lancet 351:9117 1702- 
3 (1998), Keams et al, Gene Ther. 9:748-55 (1996)). 

[193] RepUcation-deficient recombinant adenoviral vectors (Ad) can be 
engineered such that a transgene replaces the Ad Ela, Elb, and E3 genes; subsequently the 
replication defector vector is propagated in human 293 cells that supply deleted gene function 
in trans. Ad vectors can transduce multiply types of tissues in vivo, including nondividing, 
differentiated cells such as those found in the liver, kidney and muscle system tissues. 
Conventional Ad vectors have a large carrymg capacity. An example of tiie use of an Ad 
vector in a clinical trial involved polynucleotide therapy for antitumor immimization with 
intramuscular injection (Sterman et al, Hum. Gene Ther. 7:1083-9 (1998)). Additional 
examples of the use of adenovirus vectors for gene transfer in clinical trials include 
Rosenecker et al. Infection 24:1 5-10 (1996); Sterman et al. Hum. Gene Ther. 9:7 1083- 
1089 (1998); Welsh et al. Hum. Gene Ther. 2:205-18 (1995); Alvarez etal. Hum. Gene 
Ther. 5:597-613 (1997); To^f etal. Gene Ther. 5:507-513 (1998); Sterman etal. Hum. Gene 
772er. 7:1083-1089 (1998). 

[1941 Packaging cells are used to form virus particles that are capable of 
infecting a host cell. Such cells include 293 cells, which package adenoviras, and ypl cells or 
PA317 cells, which package retroviras. Viral vectors used in gene therapy are usually 
generated by producer cell line that packages a nucleic acid vector into a viral particle. The 
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vectors typically contauRhe miniinal viral sequences required iM^ckaging and subsequent 
integration into a host, other viral sequences being replaced by an expression cassette for the 
protein to be expressed. The missing viral functions are supplied in trans by the packaging 





cell line. For example, AAV vectors used in gene therapy typically only possess ITR 
5 sequences from the AAV genome which are required for packaging and integration into the 
host genome. Viral DNA is packaged in a cell line, which contains a helper plasmid 
encoding the other AAV genes, namely rep and cap, but lacking ITR sequences. The cell 
line is also infected with adenovirus as a helper. The helper virus promotes repUcation of the 
AAV vector and expression of AAV genes from the helper plasmid. The helper plasmid is 
1 0 not packaged in significant amounts due to a lack of ITR sequences. Contamination with 

adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than 



vector be deUvered with a high degree of specificity to a particular tissue type. A viral vector 
15 is typically modified to have specificity for a given cell type by expressing a ligand as a 

fiision protein with a viral coat protein on the viruses outer surface. The ligand is chosen to 
have affinity for a receptor known to be present on the cell type of interest. For example, 
Han et al.PNAS 92:9747-9751 (1995), reported that Moloney murine leukemia virus can be 
modified to express human heregulin fiised to gp70, and the recombinant virus infects certain 
20 human breast cancer cells expressing human epidermal growth factor receptor. This principle 
can be extended to other pairs of virus expressing a ligand fiision protein and target cell 
expressing a receptor. For example, filamentous phage can be engineered to display antibody 
fragments (e.g., FAB or Fv) having specific binding aflBnity for virtually any chosen cellular 
receptor. Although the above description applies primarily to viral vectors, ttie same 
25 principles can be applied to nonviral vectors. Such vectors can be engineered to contain 
specific uptake sequences thought to favor uptake by specific target cells. 



individual patient, typically by systemic administration (e.g., intravraious, intraperitoneal, 
intramuscular, subdermal, or intracranial infiision) or topical appUcation, as described below. 
30 Alternatively, vectors can be delivered to cells ex vivo, such as cells explanted from an 
individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal 
donor hematopoietic stem cells, followed by reimplantation of the cells into a patient, usually 
after selection for cells which have incorporated the vector. 



AAV. 



[195] In many gene therapy applications, it is desirable that the gene therapy 



[196] Gtene tilierapy vectors can be delivered in vivo by administration to an 
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[1971 



Ivo cell transfection for diagnostics. 



sh, or for gene therapy 



(e.g., via re-infusion of the transfected cells into the host organism) is well known to those of 
skill in the art. In some embodiments, cells are isolated from the subject organism, 
transfected with a nucleic acid (gene or cDNA) encoding a polypeptides of the invention, and 
re-infused back into the subject organism (e.g., patient). Various cell types suitable for ex 
vivo transfection are well known to those of skill in the art (see, e.g., Freshney et al. Culture 
of Animal Cells, A Manual of Basic Technique (3rd ed. 1994)) and the references cited 
therein for a discussion of how to isolate and culture ceUs from patients). 



transfection and gene therapy. The advantage to using stem cells is that they can be 
differentiated into other cell types in vitro, or can be introduced into a mammal (such as the 
donor of the cells) where they will engraft in the bone marrow. Methods for differentiating 
CD34+ cells in vitro into clinically important immune cell types using cytokines such a GM- 
CSF, IFN-7andTNF-aareknown(5reeInabae^ a/., J. Exp, Med. 176:1693-1702 (1992)). 



known methods. For example, stem cells are isolated from bone marrow cells by panning the 
bone marrow cells with antibodies which bind unwanted cells, such as CD4+ and CD8+ (T 
cells), CD45+ (panB cells), GR-1 (granulocytes), and lad (differentiated antigen presenting 
cells) {see Inaba et al, J. Exp. Med. 176:1693-1702 (1992)). 



therapeutic nucleic acids can be also administered directly to the organism for transduction of 
cells mvzvo. Alternatively, naked DNA can be administered. Administrationisby any of the 
routes normally used for introducing a molecule into ultimate contact with blood or tissue 
cells. Suitable methods of administering such nucleic acids are available and well known to 
those of skill in the art, and, although more than one route can be used to administer a 
particular composition, a particular route can often provide a more immediate and more 
effective reaction than another route. 

[201] Phaxmaceutically acceptable carriers are determined in part by the 
particular composition being administered, as well as by the particular method used to 
administer the composition. Accordingly, there is a wide variety of suitable formulations of 
pharmaceutical compositions of the present invention, as described below {see, e.g.. 
Remington's Pharmaceutical Sciences, 17th ed., 1989). 



[198] In one embodiment, stem cells are used in ex vivo procedures for cell 



[199] Stem cells are isolated for transduction and differentiation using 



[200] Vectors (e.g., retroviruses, adenoviruses, liposomes, etc.) containing 
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lABETES 

[202] The present invention also provides methods of diagnosing diabetes or 
a predisposition of at least some of the pathologies of diabetes. Diagnosis can involve . 
determination of a genotype of an individual (e.g., with SNPs) and comparison of the 
genotype with alleles known to have an association with the occurrence of diabetes. 
Alternatively, diagnosis also involves determining the level of a polypeptide or 
polynucleotide of the invention in a patient and then comparing the level to a baseline or 
range. Typically, the baseline value is representative of a polypeptide or polynucleotide of 
the invention in a healthy (e.g., lean) person. 

[203] As discussed above, variation of levels (e.g., low or high levels) of a 
polypeptide or polynucleotide of the invention compared to the baseline range indicates that 
the patient is either diabetic or at risk of developing at least some of tiie pathologies of 
diabetes (e.g., pre-diabetic). The level of a polypeptide m a lean individual can be a reading 
from a single individual, but is typically a statistically relevant average from a group of lean 
individuals. The level of a polypeptide in a lean individual can be represented by a value, for 
example in a computer program. 

[204] In some embodiments, the level of polypeptide or polynucleotide of 
the invention is measured by taking a blood, urine or tissue sample from a patient and 
measuring the amount of a polypeptide or polynucleotide of the invention in the sample using 
any number of detection methods, such as those discussed herein. For instance, fasting and 
fed blood or urine levels can be tested. 

[205] In some embodiments, the baseline level and the level in a lean sample 
from an individual, or at least two samples from the same individual differ by at least about 
5%, 10%, 20%, 50%, 75%, 100%, 150%, 200%, 300%, 400%, 500%, 1000% or more. In 
some embodiments, the sample from the individual is greater by at least one of the above- 
listed percentages relative to the baseline level. In some embodiments, the sample from the 
individual is lower by at least one of the above-listed percentages relative to the baseline 
level. 

[206] In some embodiments, the level of a polypeptide or polynucleotide of 
the invention is used to monitor the effectiveness of antidiabetic therapies such as 
thiazolidinediones, metformin, sulfonylureas and other standard therapies. In some 
embodiments the activity or expression of a polypeptide or polynucleotide of the invention 
will be measured prior to and after treatment of diabetic or pre-diabetic patients with 
antidiabetic therapies as a surrogate marker of clinical effectiveness. For example, the 
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greater tiie reduction in^^ression or activity of a polypeptide dSR invention indicates 
greater effectiveness, 

[207] Glucose/insulin tolerance tests can also be used to detect the effect of 
glucose levels on levels of a polypeptide or polynucleotide of the invention. In glucose 
tolerance tests, the patient's ability to tolerate a standard oral glucose load is evaluated by 
assessing serum and urine specimens for glucose levels. Blood samples are taken before the 
glucose is ingested, glucose is given by mouth, and blood or virine glucose levels are tested at 
set intervals after glucose ingestion. Similarly, meal tolerance tests can also be used to detect 
the effect of insulin or food, respectively, on levels of a polypeptide or polynucleotide of the 
invention. 

[208] All publications and patent applications cited in this specification are 
herein incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 

[209] Although the foregoing invention has been described in some detail by 
way of illustration and example for purposes of clarity of understanding, it will be readily 
apparent to one of ordinary skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without dq)arting from the spirit or 
scope of the appended claims. 

EXAMPLES 

[210] The following examples are offered to illustrate, but not to limit the 
claimed invention. 

[211] The molecular mechanism by which fliiazolidmediones (TZDs) cause 
an increase in peripheral insulin sensitivity was studied. Genes in muscle or fat whose 
expression is altered by TZDs may lie on a pathway leading firom TZD treatment to mcreased 
insulin sensitivity. Modulators of such genes can elicit the same effect as TZD treatment. 
Moreover, such modulators can lack some of the side effects of TZD. The majority of 
glucose disposal occurs in muscle. For this reason, gene expression profiling in human 
muscle firom diabetics treated with troglitazone was used to identify genes important for TZD 
action and therefore treatment of diabetes and insulin resistance. 

[212] Gene expression profiling was performed on tissue samples (muscle) 
obtained fi-om lean obese and diabetic individuals. Two studies were performed. In the first 
study, basal samples were isolated firom all individuals at the beginning of a 5 hoxa 
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hyperinsulinemic eugly^iic clamp. Clamp samples were isolaWrat the end of this 
procedure. Similar basal and clamp samples were taken 3 months later after all patients had 
taken the insulin sensitizing drug troglitazone. 

[213] In the second study, samples were obtained from lean, obese and 
diabetic individuals before and after a hyperinsulinemic euglycemic clamp. No troglitazone 
treatment was used. For all tissue samples mKNA was isolated from these muscle samples 
and converted to cRNA by standard procedures. The gene expression profile for each 
individual was determined by hybridization of cRNA to custom synthesized AfiEymetrix 
chips. 

[214] Gene expression profile differences were calculated as follows. The 
expression level of a particular gene is indicated by its ^average difference score'. The raw 
data was analyzed by a statistical test to remove ^outliers'. The mean 'average difference 
score* was then calculated from the average difference scores for all individuals in a 
particular treatment group. Genes were determined to be changed in condition 1 (such as 
basal diabetics before trogliazone (tro) treatment) versus condition 2 (such as basal diabetics 
after troglitazone treatment) by calculating the Students t test statistic between the two 
conditions and selecting those with t less than or equal to 0.05. Fold change was determined 
as the ratio of mean average difference score in condition 2 to the mean average difference 
score in condition 1 . 

[2151 Probe set mbxhummus 24330 detects expression of connective tissue 

growth factor (CTGF) mKNA. 

[216] Expression of transcripts encoding Connective Tissue Growth Factor 
(SEQ ID NO:2) was decreased in tro-treated patients compared to untreated patients in this 
study. 
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sample was taken before or after 3 months of Troglitazone treatment; *TV[ean Expr" indicates 
mean expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
samples; 'Told Change" indicates fold change of diabetic post tro/diabetic pre tro. 
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:2) contains the 



[217] Comective Tissue Growth Factor (SEQ 
following protein domains: 
Signal peptide at amino acids 1 to 27; 

Insulm growth factor-biading protein homologues domain at amino acids 27 to 97; 
Von Willebrand factor (vWF) type C domain at amino acids 103 to 166; 
Thrombospondin type 1 repeat Domain at amino acids 200 to 243; and 
C-terminal cystine knot-like domain at amino acids 261 to 330. 

[21 8] Probe set mbxhurmnus23808 detects expression of TDEG mRNA 
[2191 Expression of transcripts encoding TIEG (SEQ ID NO:8) was higher in 
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[220] Expression of transcripts encoding TIEG (SEQ ID NO:8) was reduced 
in tro-treated patients compared to untreated patients. 
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Legend: B/C indicates sample is from Basal or Clamp; 'Tre-Trog'* and 'Tost-Trog" indicates 
sample was taken before or after 3 months of Troglitazone treatment; "Mean Expr" indicates 
mean expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
samples; 'Told Change" indicates fold change of diabetic post-trog/diabetic pre-trog. 

[221] Transcripts encoding TIEG (SEQ ID NO:8) contain the following 
protein domains: 

zf-C2H2 domain encoded by amino acids 358 to 382 
zf-C2H2 domain encoded by amino acids 388 to 412 
zf-C2H2 domain encoded by amino acids 418 to 440. 

[2221 A splice variant of TIEG is also known (SEQ ID NO :13), which 
encodes the amino acid sequence displayed in SEQ ID NO:14. 

[2231 Transcripts encoding TIEG (SEQ ID NO: 13) contain the following 
protein domains: 

zf-C2H2 domain encoded by amino acids 369 to 393 
zf-C2H2 domain encoded by amino acids 399 to 423 
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zf-C2H2 domain encooia by amino acids 429 to 440. 

[2241 The gene expression changes observed in the human muscle samples 
derived from lean and diabetic individuals or from diabetic individuals post-Troglitazone 
5 treatment, were further confirmed by the use of real-time quantitative PGR. A combination 
of specific PGR primers and a Taqman probe were designed to detect and quantify expression 
levels for each gene. Relative gene expression levels were determined in this manner in each 
patient and the mean expression level in either the lean or diabetic patient population was 
then calculated. 

1 0 [225] In order to demonstrate the ability of the gene to modulate insulin 

sensitivity a number of different analyses were undertaken. One analysis called "Glut 4 
translocation" involves over-expressing the gene in 3T3-L1 adipocytes and then monitoring 
the ability of insulin to cause the movement (translocation) of the major insulin-regulated 
glucose transporter. Glut 4, to the cell surface. The cDNA encoding the hxmaan gene was 

1 5 sub-cloned into the mammalian expression vector pcDNA3 . 1 which provides for a V5 

epitope tag. This plasmid encoding the human gene was then introduced into differentiated 
murine 3T3-L1 adipocytes along with a cloned Glut 4 expression construct using an 
electroporation procedure essentially as described in Kanzaki et al J. BioLChem 275 7167- 
7175 (2000). A plasmid expressing LacZ was used as a negative control. After stimulation 

20 with increasing concentrations of insidin, cells co-expressmg both the human gene and 

cloned Glut 4 were scored for the presence of Glut 4 at the cell surface using fluorescence 
microscopy. 

[226] In some instances another analysis called "glucose transport" was used. 
This involves over-expressing tiie gene in either differentiated 3T3-L1 adipocytes or 

25 differentiated L6 myotubes and then monitoring the ability of insulin to stimulate glucose 
transport into these two cell types. A recombinant adenovirus encoding the human gene 
tagged with a FLAG epitope tag was prepared essentially as described in Zhou et al Proc. 
Nat Acad. Set USA. 95: 2509-2514, 3T3-L1 adipocytes or L6 myotubes were infected with 
recombiaant adenovirus expressing the human gene. A vims expressing an irrelevant protein 

30 was used as the control. 24 h post-infection the cells are stimulated with increasing 

concentrations of insulin and glucose transport into the cells determined by the coimting the 
accumulation of a radiolabelled glucose analog (2-deoxyglucose) essentially as describe in 
Kotani et al Mol Cell Biol 18 6971-6982 (1998), Fujishiro et al J. Biol Chem. 276 19800- 
19806 (2001), Ross et al Biochem. Biophys. Res. Commun. 302 354-358 (2003). 
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Real time PGR analysis 
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Legend •Tro" indicates treatment with Troglitazone. 'Told change' mdicates told cHange m 
CTGF expression calculated as the ratio of mean diabetic post-Tro expression/mean diabetic 
pre-Tro e3q)ression. Numbers in parentheses indicate the number of patient samples analyzed 
by real time PGR. 

[227] These data further show that treatment with Troglitazone reduces the 
expression of CTGF in muscle from diabetic individuals. 
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cell surface Glut 4). "n" is the nvimber of replicates. 

(2281 These data show that increasing the levels of hCTGFin 3T3-L1 
adipocytes significantly inhibited Glut 4 translocation at all insulin concentrations tested with 
little or no effect on basal Glut 4 translocation. 

Glucose transport analysis m 3T3-L1 adipocytes 
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0.7 



0.233 



0.63 +/- 0.06 
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0.7 
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0.67+/- 0.02 
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0.76+/- 0.05 
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0.83 +/- 0.02 
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0.6 



0.015 



0.7 



0.012 



0.9 



0.143 



0.75+/- 0.10 



25 



Legend "Con" indicates control cells which do not express hCTGF. "FC" indicates the fold 
change defined as the following ratio; glucose transport in hCTGF-expressing cells/glucose 
transport in non-hCTGF-expressing cells, h" is human, "n" is die numbo: of replicates. 
is the standard error of the mean. 
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[229] TB^e data show that increasing the level^^PBiCTGF in 3T3-L1 
adipocytes inhibited both basal and insulin stimulated glucose transport 
Glucose transport analysis in L6 myotnbes 
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0.042 
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0.79 +/- 0.004 
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0.163 



0.92 



0.121 



0.77 



0.080 



0.79 +/- 0.07 
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20 



25 



100 



0.70 



0.226 



1.05 



0.404 



0.74 



0.001 



0.83+/- 0.11 



Legend *'Con" indicates control cells which do not express CTGF. *TC" indicates tiie fold 
change defined as the following ratio; glucose transport in CTGF-expressing cells/glucose 
transport in non-CTGF-expressing cells, h" is human, "n" is the number of replicates. SEM 
is the standard error of the mean. 

[230] These data show that increasing the levels of hCTGF in L6 myotubes 
inhibited both basal and insulin stimulated glucose transport. These data are consistent with 
the data obtained in 3T3-L1 adipocytes. 

[231] Decreased CTGF levels such as those observed in diabetic patients 
treated with troglitazone would therefore be beneficial at improving glucose uptake into 
muscle and fat. 
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TDEG 



Real time PCR analysis 



Comparison 



Diabetics + Tro (9) /Diabetics - Tro 

(?) 



Diabetics (19)/Lean (17) 



Expression Fold change 



0.69 



2.56 



ttest 



0.057 



0.005 



Legend "Tro" indicates treatment with Troglitazone. 'Told change" indicates fold change in 
TIEG expression calculated as either the ratio of mean diabetic post-Tro expression/mean 
diabetic pre-Tro expression or mean diabetic pre-Tro expression/mean lean pre-Tro 
expression. Numbers in parentheses indicate the niunber of patient samples analyzed by real 
time PCR. 

[232] These data fiirther show that TIEG is significantly over-expressed in 
muscle firom diabetic individuals when compared to muscle &om lean individuals. 
Fvirthermore the data also shows that treatment with Troglitazone reduces the e3q)ression of 
TIEG in muscle fix>m diabetic individuals. 
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Fold Change (Mean hTIEG/Mean LacZ) (n=3) 



ttest 
(hUEGvsLacZ) 



1.34 



0.600 



0.20 



0.019 



10 



0.11 



5.1E-05 



Legend "Fold Change" indicates the following ratio; (Mean % of hTJEG-expressing cells 
scored positive for cell surface Glut 4)/(Mean % of LacZ-expressing cells scored positive for 
cell surface Glut 4). h" is human, "n" is the number of replicates. 

[233] These data show that increasing the levels of hTIEG in 3T3-L1 
adipocytes significantly inhibited Glut 4 translocation at all insulin concentrations tested with 
little or no effect on basal Glut 4 translocation. Increased TIEG levels such as those observed 
in human diabetic muscle are therefore inhibitory to insulin action. 
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WHAT IS CLAIMED 

1 1 . A method for identifying an agent for treating a diabetic or pre-diabetic 

2 individual, the method comprising the steps of: 

3 (i) contacting an agent to a mixture comprising a polypeptide encoded by 

4 a polynucleotide that hybridizes under stringent conditions to a nucleic acid encoding SEQ 

5 ID N0:2, SEQ ID NO:4, SEQ ID N0:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12 or 

6 SEQ ID NO: 14; and 

7 (ii) selectiQg an agent that modulates the expression or activity of the 

8 polypeptide or that binds to the polypeptide, thereby identifying an agent for treating a 

9 diabetic or pre-diabetic individual. 

1 2. The method of claim 1, the method further comprising selecting an 

2 agent that modulates insulin sensitivity. 

1 3 . The method of claim 1 , wherein step (ii) comprises selecting an agent 

2 that modulates expression of the polypeptide. 

1 4. The method of claim 1, wherein step (ii) comprises selecting an agent 

2 that modulates the activity of the polypeptide. 

1 5. The method of claim 1, wherein step (ii) comprises selecting an agent 

2 that specifically binds to the polypeptide. 

1 6. The method of claim 1 , wherein the polypeptide is expressed in a cell 

2 and the cell is contacted with the agent. 

1 7. A method of treating a diabetic or pre-diabetic animal, the method 

2 comprising administering to the animal a therapeutically effective amount of an agent 

3 identified by the method of claim 1 . 

1 8. The method of claim 7, wherein the agent is an antibody. 

1 9, The method of claim 8, wherein the antibody is a monoclonal 

2 antibody. 

1 10. The method of claim 7, wherein the animal is a human. 
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1 11 . A method of introducing an expression cassette into a cell, the method 

2 comprising, 

3 introducing into the cell an expression cassette comprising a promoter 

4 operably linked to a polynucleotide encoding a polypeptide, wherein the polynucleotide 

5 hybridizes under stringOTt conditions to a nucleic acid encoding SEQ ID NO:2, SEQ ID 

6 NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12 or SEQ ID NO:14. 

1 12. The method of claim 11, wherein the cell is selected from the group 

2 consisting of an adipocyte and a skeletal muscle cell. 

1 13. The method of claim 1 1, the method further comprising introducing 

2 the cell into a human. 

1 14. The method of claim 13, wherein the human is diabetic. 

1 15. The method of claim 13, wherein the hmnan is prediabetic. 

1 16. The method of claim 13, wherem the cell is from the human. 

1 17. A method of diagnosing an individual who has Type 2 diabetes or is 

2 prediabetic, the method comprising, 

3 detecting in a sample from the individual the level of a polypeptide or the level 

4 of a polynucleotide encoding the polypeptide, wherein the polynucleotide hybridizes under 

5 stringent conditions to a nucleic acid encoding an amino acid sequence selected from the 

6 group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID 

7 NO:10, SEQ ID NO:12 or SEQ ID NO: 14, 

8 wherein a modulated level of tiie polypeptide or polynucleotide in the sample 

9 compared to a level of the polypeptide or polynucleotide in either a lean individual or a 
10 previous sample from the individual indicates that the individual is diabetic or prediabetic. 

1 18. The method of claim 17, wherein the detecting step comprises 

2 contacting the sample with an antibody that specifically binds to the polypeptide. 

1 19. The method of claim 17, wherein tiie detecting step comprises 

2 quantifying mKNA encoding the polypeptide. 
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1 20. The method of claim 19, wherein the mRNA is reverse transcribed and 

2 ampUfied in a polymerase chain reaction. 

1 21. The mediod of claim 17, wherein the sample is a blood, urine or tissue 

2 sample. 

1 22. An isolated nucleic acid encoding SEQ ID NO: 14. 

1 23. The isolated nucleic acid of claim 22, wherein the nucleic acid 

2 comprises SEQ ID N0:13. 

1 24. An expression cassette comprising a heterologous promoter operably 

2 linked to a polynucleotide encoding SEQ ID NO: 14 

1 25. The expression cassette of claim 24, wherein the polynucleotide 

2 comprises SEQ ID NO: 1 3 . 

1 26. A host cell comprising the expression cassette of claim 24. 
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SEQUENCE LISTING 
SEQ ID NO:l Human CTGF nucleic acid 

>Nucleotide sequence HUM205074 accession:X78947 CDS : 146 1195 
TCCAGTGACGGAGCCGCCCGGCCGACAGCCCCGAGACGAaVGCCCGGCGCGTCCCGGTCCCCACCTCCGAC 
5 GCCAGCGCTCCAGGCCCCGCGCTCCCCGCTCGCCGCCACCGCGCCCTCCGCTCCGCCCGCAGTGCCAACCATGAC 
CGCCGCCAGTATGGGCCCCGTCCGCGTCGCCTTCGTGGTCCTCCTCGCCCTCTGCAGCCGGCCGGCCGTCGGCCA 
GAACTGCAGCGGGCCGTGCCGGTGCCCGGACGAGCCGGCGCCGCGCTGCCCGGCGGGCGTGAGCCTCGTGCTGGA 
CGGCTGCGGCTGCTGCCGCGTCTGCGCCAAGCAGCTGGGCGAGCTGTGCACCGAGCGCGACCCCTGCGACCCGCA 
CAAGGGCCTCTTCTGTGACTTCGGCTCCCCGGCCAACCGCAAGATCGGCGTGTGCACCGCCAAAG^^ 

10 CTGCATCTTCGGTGGTACGGTGTACCGCAGCGGAGAGTCCTTCCS^GAGCaVGCTGa^GTACCAGTGCACGTGCCT 
GGACGGGGCGGTGGGCTGCATGCCCCTGTGCAGCATGGACGTTCGTCTGCCCAGCCCTGACTGCCCCTTCCCGAG 
GAGGGTCAAGCTGCCCGGGAAATGCTGCGAGGAGTGGGTGTGTGACGAGCCCAAGGACCAAACCGTGGTTGGGCC 
TGCCCTCGCGGCTTACCGACTGGAAGACACGTTTGGCCCAGACCCAACTATGATTAGAGCCAACTGCCTC 
GACCACAGAGTGGAGCGCCTGTTCCAAGACCTGTGGGATGGGCATCTCCACCCGGGTTACCAATGACAACGCCTC 

15 CTGCAGGCTAGAGi\AGCAGAGCCGCCTGTGCy^TGGTCAGGCCTTGCGAAGCTGACCTGGAAGAGAACATTAAGJ^ 
GGGCAAAAAGTGCATCCGTACTCCCAAAATCTCCAAGCCTATCAAGTTTGAGCTTTCTGGCTGCACCAGCATGAA 
GACATACCGAGCTAAATTCTGTGGAGTATGTACCGACGGCCGATGCTGCACCCCCCACAGAACCACCACCCTGCC 
GGTGGAGTTCAAGTGCCCTGACGGCGAGGTCATGAAGAAGAACATGATGTTCATCAAGACCTGTGCCTGCCATTA 
CaUVCTGTCCCGGAGACAATGACATCTTTGAATCGCTGTACTACAGGAAGATGTACGGAGACATGGCATGAAGCCA 

20 GAGAGTGAGAGACATTAACTCATTAGACTGGAACTTGAACTGATTOVCaVTCTCaVTOT 

AGTAGCACAAGTTATTTAAATCTGTTTTTCTAACTGGGGGAAAAGATTCCCACCCa^TTCAAJ^ 
GTCAAACAM^TAGTCTATCTTCCCCAGACACTGGTTTGAAGAATGTTAAGACTTGACAGTGGAACTACaVTTAGTA 
CACAGCACCAGAATGTATATTAAGGTGTGGCTTTAGGAGCAGTGGGAGGGTACCAGCAGAAAGGTTAGTATCATC 
AGATAGCTCTTATACGAGTAATATGCCTGCTATTTGAAGTGTAATTGAGAAGGAAAATTTTAGCGTGCTCACTGA 

25 CCTGCCTGTAGCCCCAGTGACAGCTAGGATGTGCATTCTCCAGCCATCAAGAGACTGAGTC^ 
GTCAGAACAGCAGACTCT^GCTCTGACATTCTGAlTCGAATGACa^CTGTTCyVGGAATCGGA^ 
CTGGACAGCTTGTGGCAAGTGAATTTCCTGTAACAAGCCAGATTTTTTAAAATTTATATTGTAAATATTGTG^^ 
GTGTGTGTGTGTGTATATATATATATATATGTACAGTTATCTAAGTTAATTTAAAGTTGTTTGTGCCTTTTTATT 
TTTGTTTTTAATGCTTTGATATTTCAATGTTAGCCTCAATTTCTGAACACCATAGGTAGAATGTAAAGCTTGTCT 

30 GATCGTTCAAAGCATGAAATGGATACTTATATGGAAATTCTCTCAGATAGAATGACAGTCCGTCAAAACAGATTG 
TTTGCAAAGGGGAGGCATCAGTGTCCTTGGCAGGCTQATTTCTAGGTAGGAAATGTGGTAGCTCACGCTCACTTT 
TAATGAACAAATGGCCTTTATTAAAAACTGAGTGACTCTATATAGCTGATCAGTTTTOTCACCTGGAAGCaVTOT 
TTTCTACTTTGATATGACTGTTTTTCGGACAGTTTATTTGTTGAGAGTGTGACCAAAAGTTACATGTTTGCACCT 
TTCTAGTTGAAAATAAAGTATATTTTTTCTAAAAAAAAAAAAAAACGACAGCAACGGAATTC 

35 

SEQ ID NO:2 Human CTGF polypeptide 
>Protein secjuence protein_id : gi474934 

MTAASMGPWVAFVVLLALCSRPAVGQNCSGPCRCPDEPAPRCPAGVSLVLDGCGCCRVCAKQLGELCTEF^ 
PHKGLFCDFGSPANRKIGVCTAKDGAPCIFGGTVYRSGESFQSSCKYQCTCLDGAVGCMPLCSMDVRLPSPDCPF 
40 PRRVKLPGKCCEEWCDEPKDQTWGPALAAYRLEDTFGPDPTMIRANCLVQTTEWSACSKTCGMGISTRVT^ 
ASCRLEKQSRLCMVRPCEADIiEENIKKGKKCIRTPKISKPIKFELSGCTSMKTYRAKFCGVCTDGRCCTPHRTO^ 
IiPVEFKCPDGEVMKKNMMFIKTCACHYNCPGDNDIFESLYYRKMYGDm 

SEQIDNO:3 

45 MOUSE CTGF HOMOLOG: 

>Nucleoticie sequence accession :M802 63 CDS : 204 1250 

AGACTCAGCCAGATCCACTCCAGCTCCGACCCCAGGAGACCGACCTCCTCCAGACGGCAGCAGCCCCAGCCCAGC 
CGACAACCCCAGACGCCACCGCCTGGAGCGTCCAGACACCAACCTCCGCCCCTGTCCGAATCCAGGCTCCAGCCG 
CGCCTCTCGTCGCCTCTGCACCCTGCTGTGCATCCTCCTACCGCGTCCCGATCATGCTCGCCTCCGTCGCAGGTC 

50 CCATCAGCCTCGCCTTGGTGCTCCTCGCCCTCTGCACCCGGCCTGCTACGGGCCAGGACTGCAGCGCGCAATGTC 
AGTGCGCAGCCGAAGCAGCGCCGCACTGCCCCGCCGGCGTGAGCCTGGTGCTGGACGGCTGCGGCTGCTGCCGCG 
TCTGCGCCAAGCAGCTGGGAGAACTGTQTACGGAGCGTGACCCCTGCGACCCACACAAGGGCCTCTTCTGCGATT 
TCGGCTCCCCCGCCAACCGCAAGATTGGAGTGTGCACTGCCAAAGATGGTGCACCCTGTGTCTTCGGTGGGTCGG 
TGTACCGCAGCGGTGAGTCCTTCCAAAGCAGCTGCAAATACCAATGCACTTGCCTGGATGGGGCCGTGGGCTGCG 

55 TGCCCCTATGCAGCATGGACGTGCGCCTGCCCAGCCCTGACTGCCCCTTCCCGAGAAGGGTCAAGCTGCCTGGGA 
AATGCTGCGAGGAGTGGGTGTGTGACGAGCCCAAGGACCGCACAGCAGTTGGCCCTGCCCTAGCTGCCTACCGAC 
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TGGAAGACACyVTTTGGCCCTGACCCAACTATGATGCGAGCCAACTGCCTGG^BVGACC^ 
GTTCTAAGACCTGTGGAATGGGCATCTCCy^CCCGAGTTACCAATGACAATACCTTCTGCAGACTGGAGi^G 
GCCGCCTCTGCATGGTCAGGCCCTGOSAAGCTGACCTGGAGGAAAACATTAAGAAGGGC^^ 
CACCTAAAATCGCCAAGCCTGTCAAGTTTGAGCTTTCTGGCTGCACCSVGTGTGAAGACATA^ 
5 GCGGGGTGTGCACAGACGGCCGCTGCTGCACACCGCACAGAACCACCACTCTGCCAGTGGAGTTC^^ 

ATGGCGAGATCATGAAAAAGAATATGATGTTCATCAAGACCTGTGCCTGCCATTACAACTGTCCTGGGGACAATG 
ACATCTTTGAGTCCCTGTACTACAGGAAGATGTACGGAGACATGGCGTAAAGCCAGGAAGTAAGGGACACGAACT 
CATTAGACTATAACTTGAACTGAGTTGCATCTCATTTTCTTCTGTAAAAACT^TTAG^ 
ATCTGTGTTTTTAACTACCGTGGGAGGAACTATCCCACCAAAGTGAGAACGl^ATGTC^ 

10 CTGTCAACCTCAGACACTGGTTTCGAGACAGTTTACACTTGACAGTTGTTCATTAGCGCACAGTGCCAGAACGCA 
CACTGAGGTGAGTCTCCTGGAACAGTGGAGATGCCAGGAGAAAGAAAGACAGGTACTAGCTGAGGTTATTTTAAA 
AGCAGCAGTGTGCCTACTTTTTGGAGTGTAACCGGGGAGGGAAATTATAGCATGCTTGCAGACAGACCTGCTCTA 
GCGAGAGCTGAGCATGTGTCCTCCACTAGATGAGGCTGAGTCCAGCTGTTCTTTAAGAACAGCAGTTTCAGCTCT 
GACCATTCTGATTCCaVGTGACUVCTTGTCaVGGAGTaVGAGCCTTGTCTGTTAGACTGGACAGCTTGTGG 

1 5 GTTTGCCTGTAACAAGCCAGATTTTTATTGATATTGTAAATATTGTGGATATATATATAT^ 

TACAGTTATCTAAGTTAATTTAAAGTCATTTGTTTTTGTTTTAAGTGCTTTTGGGATTTTAAACTGATAGCCTC^ 
AACTCCAAACACCATAGGTAGGACACGAAGCTTATCTGTGATTCAAAACAAAGGAGATACTGCAGTGGGAATTGT 
GACCTGAGTGACTCTCTGTCAGAACAAACAAATGCTGTGCAGGTGATAAAGCTATGTATTGGAAGTCAGATTTCT 
AGTAGGAAATGTGGTCAAATCCCTGTTGGTGAACAAATGGCCTTTATTAAGAAATGGCTGGCTCAGGGTAAGGTC 

20 CGATTCCTACCAGGAAGTQCTTGCTGCTTCTTTGATTATGACTGGTTTGGGGTGGGGGGCAGTTTATT^ 
AGTGTGACCAAAAGTTACATGTTTGCACTTTCTAGTTGAAAATAAAGTA^^ 
AAAAA 

SEQ ID N0:4 Mouse CTGF polypeptide 

25 >Protein sequence accession:gil09590 

MLASVAGPISIJUjVLLALCTRPATGQDCSAQCQCAAEAAPHCPAGVSLVLDGCG 

HKGLFCDFGSPANRKIGVCTAKDGAPCVFGGSWRSGESFQSSCKYQCTCLDGAVGCVPLCSMDVRLPSPDCPPP 
RRVKLPGKCCEEWCDEPKDRTAVGPAIJUVYRLEDTFGPDPTMM^^ 

FCRLEKQSRLCMVRPCEADIiEENIKKGKKCIRTPKIAKPVKFELSGCTSVKTYRAKFCGVCTDGRCCTPHRTTTL 
30 PVEFKCPDGEIMKKNMMFIKTCACHYNCPGDNDIFESLYYRKMYGDMA 

SEQEDNOiS 

RAT CTGF HOMOLOG: 

>Nucleotide sequence accession :15M__0 222 66 CDS : 225 1268 

35 CACAGCTCTTCTCTCCAAGAAGACTCAGCCAGACCCACTCCAGCTCCGACCCTAGGAGACCGACCTCCTCCAGAC 
GGCAGCAGCCCCAGCCCAGTGGACAACCCCAGGAGCCACCACCTGGAGCGTCCGGACACCAACCTCCGCCCCGAG 
ACCGAGTCCAGGCTCCGGCCGCGCCCCTCGTCGCCTCTGCACCCCGCTGTGCGTCCTCCTGCCGCGCCCCGACCA 
TGCTCGCCTCCGTCGCGGGTCCCGTTAGCCTCGCCTTGGTGCTCCTCCTCTGCACCCGGCCTGCCACCGGCCAGG 
ACTGCAGCGCGCAGTGTCAGTGCGCAGCTGAAGCGGCGCCGCGCTGCCCCGCCGGCGTGAGCCTGGTGCTGGACG 

40 GCTGCGGCTGCTGCCGCGTCTGCGCCAAGCAGCTGGGAGAACTGTGCACGGAGCGTGATCCCTGCGACCCACACA 
AGGGTCTCTTCTGCGACTTCGGCTCCCCCGCCAACCGCAAGATTGGCGTGTGCACTGCCAAAGATGGTGCACCCT 
GTGTCTTCGGTGGGTCCGTGTACCGCAGCGGCGAGTCCTTCCAAAGCAGTTGCAAATACCAGTGCACTTGCCTGG 
ATGGGGCCGTGGGCTGTGTGCCCCTGTGCAGCATGGACGTGCGCCTGCCCAGCCCTGACTGCCCCTTCCCGAGAA 
GGGTCAAGCTGCCCGGGAAATGCTGTGAGGAGTGGGTGTGTGATGAGCCCAAGGACCGCACAGTGGTTGGCCCTG 

45 CCCTAGCTGCCTACCGACTGGAAGACACATTTGGCCCTGACCCAACTATGATGCGAGCCAACTGCCTGGTCCAGA 
CC!ACAGAGTGGAGCGCCTGTTCTAAGACCTGTGGGATGGGCATCTCCACCCGGGTTACCAATGACAATACCTTCT 
GCAGGCTGGAGAAGCAGAGTCGTCTCTGCATGGTCAGGCCCTGTGAAGCTGACCTAGAGGAAAACATTAAGAAGG 
GCAAAAAGTGCATCCGGACGCCTAAT^TTGCCAAGCCTGTCAAGTTTGAGCTTTCTGGCTGCACCAGTGTGAAGA 
CCTACCGGGCTAAGTTCTGTGGGGTGTGCACGGACGGCCGCTGCTGCACACCGCACAGAACCACCACACTGCCGG 

50 TGGAGTTCAAGTGCCCCGATGGCGAGATCATGAAAAAGAACATGATGTTCATCAAGACCTGTGCCTGCCATTACA 
ACTGTCCCGGGGACAATGACATCTTTGAGTCCTTGTACTACAGGAAGATGTATGGAGACATGGCGTAAAGCCAGG 
GAGTAAGGGACACGAACTCATTTAGACTATAACTTGAACTGAGTTACATCTCATTTTCTTCTGTAAAAAAACAAA 
AAGGATTACAGTAGCACATTAATTTAAATCTGGGTTCCTAACTGCTGTGGGAGAAAACACCCCACCGAAGTGAGA 
ACCGTGTGTCATTGTCATGCAAATAGCCTGTCAATCTCAGACACTGGTTTCGAGACAGTTTAGACTTGACAGTTG 

55 TTCACTAGCGCACAGTGACAGAACGCACACTAAGGTGAGCCTCCTGGAAGAGTGGAGATGCCAGGAGAAAGACAG 
GTACTAGCTGAGGTCATTTTAAAAGCAGCGATATGCCTACTTTTTGGAGTGTGACAGGGGAGGGACATTATAGCT 
TGCTTGCAGACAGACCTGCTCTAGCAAGAGCTGGGTGTGTGTCCTCCACTCGGTGAGGCTGAAGCCAGCTATTCT 
TTCAGTAAGAACAGCAGTTTCAGCGCTGACATTCTGATTCCAGTGACACTGGTCGGGAGTCAGAACCTTGTCTAT 
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GTGAATTTGCCGGTAACAAGCCAGATTTT5W&GATCTTGTAAATATTGTGGA 
TAAATATATATATTTGTACyVGTTATCTAAGTTAATTTAAAGACGTTTGTGCCTAOT^ 
TTGGAATTTTTAAACTGATAGCCTCTVAACTCCAAACACCM^CGATAGGA^^ 

AAAGGAGATACTGCAGTGGAAACTGTAACCTGAGTGACTGTCTGTCAGAACATATGGTACGTAGACGGTAAAGCA 
5 ATGGATCAGAAGTCAGATTTCTAGTAGGAAATGTAAAATCy^CTGTTGGCGAACAT^TGGCCTTTATO 
GCTTGCTCAGGGTAACTGGTCAGATTTCCACGAGGAAGTGTTTGCTGCTTCTT^ 
CAGTTTATTTGTTGAGAGTGTGACCAAAAGTTACATGTTTGCACCTTTC 
TTATAAAAAAAAAAAAAAAA 

10 SEQ ID NO:6 Rat CTGF polypeptide 

>Protein sequence accession :gill560085 

MLASVAGPVSIiALVLLLCTRPATGQDCSAQCQCAAEAAPRCPAGVSLVIiDGCGCCRVCA^ 
KGLFCTFGSPANRKIGVCTAKDGAPCnrFGGSVyRSGESPQSSCKYQCTCIiDGAVGCVPLCSMD 
RVKLPGKCCEEWVCDEPKDRTWGPALAAYIUJEDTFGPDPTMMRANCLVQTTEWSACSKTCGM 
15 CRLEKQSRIiCMVRPCEADLEENIKKGKKCIRTPKIAKPVKFEtiSGCTSVOT 
VEFKCPDGEIMKKNMMPIKTCaVCHYNCPGDiroiFESLYYRKMYGDMA 

SEQ ID NO:7 TIEG Human nucleic acid 

>Nucleotide sequence HUM202520 accession:BC011538 CDS:320 . . 1729 

20 CTCACGCTCTCTTTCCCTGCCGCCTGCCTTTCTTTTTTCCTTTTTTTGTATTGGCGTCTTGGGGCTG^ 

ACGCGCGCTGTCCATTGCAGCTTACATAAAGGCGGGCGCGATTATGCAATTATATTGTTAGCGATATTTCAAGAG 
CAATGGCTCGTTTTCTTAGGATTTCAACACGAAGGCATCATGCATT^ 

TTAATGGAAAATAGGAGAT^TGGAGGAAAGAATGGAAATGATTTCTGAAAGGCCAAAAGAGAGTATGTATTCCT^ 
25 GAACAAT^ACTGCAGAGAAAAGTGATTTTGAAGCTGTAGAAGCACTTATGTCAATGAGCTGCS^^ 

TTTTAAGAAATACGTTGAAAACAGACCTGTTACACCAGTATCTGATTTGTCAGAGGAAGAGAATCTGCTTCCGGG 
AACACCTGATTTTCATACAATCCCAGCATTTTGTTTGACTCCACCTTACAGTCCTTCTGACTTTGAACCCTCTCA 
AGTGTCAAATCTGATGGCACCAGCGCCATCTACTGTACACTTCAAGTCACTCTCAGATACTGCCAAACCTCACAT 
TGCCGCACCTTTCAAAGAGGAAGAAAAGAGCCCAGTATCTGCCCCCAAACTCCCCAAAGC 

30 GATTCGTCATACAGCTGATGCCCAGCTATGTAACC».CCAGACCTGCCa^TGAAAG<^GCCAG^^ 

TCAGAACAATTCTTTTAGAAGAAGAACCCACCTAAATGTTGAGGCTGCAAGAAAGAACATACCATGTGC^ 
GTCACCAAACAGATCCAAATGTGAGAGAAACACAGTGGCAGATGTTGATGAGAAAGCAAGTGCTGCACTTTATGA 
CTTTTCTGTGCCTTCCTCAGAGACGGTCATCTGCAGGTCTCAGCCAGCCCCTGTGTCCCCACAACAGAAGTCAGT 
GTTGGTCTCTCCACCTGCAGTATCTGCAGGGGGAGTGCCACCTATGCCGGTCATCTGCCAGATGGTTCCCCTTCC 

35 TGCCAACAACCCTGTTGTGACAACAGTCGTTCCCAGCACTCCTCCCAGCCAGCCACC^^ 

TGTGTTCATGGGCACACAAGTCCCCAAAGGCGCTGTCATGTTTGTGGTACCCCAGCCCGTTG 
GCCTCCGGTGGTGAGCCCGAATGGCACCAGACTCTCTCCCATTGCCCCTGCTCCTGGGTTTTCCCCTTCAGCAGC 
AAAAGTCACTCCTCAGATTGATTCATCAAGGATAAGGAGTCACATCTGTAGCCACCCAGGATGTGGCAAGACATA 
CTTTAAAAGTTCCCATCTGAAGGCCCACACGAGGACGCACACAGGAGAAAAGCCTTTCAGCTGTAGCTGGAAAGG 

40 TTGTGAAAGGAGGTTTGCCCGTTCTGATGAACTGTCCAGACU^CAGGCGAACCCACACGGGTGAGAAGAAATTTGC 
GTGCCCCATGTGTGACCGGCGGTTCATGAGGAGTGACCATTTCACCAAGCATGCCCGGCGCCATCTATC^ 
GAAGCTACCAAACTGGCAGATGGAAGTGAGCAAGCTAAATGACATTGCTCTACCTCCT^CCCCTGCTC 
GTGACAGACCGGAAAGTGAAGAGTCAGAACTAACTTTGGTCTCAGCGGGAGCCAGTGGTGATGTAAAAATGCTTC 
CACTGCAAGTCTGTGGCCCCACAACGTGGGCTTAAAGCAGAAGCCCCACAGCCTGGCACGAAGGCCCCGTCTGGG 

45 TTAGGTGACTAAAAGGGCTTCGGCCy^Cy^GGCAGGTCACAGAAAGGCAGGTTTCATTTCTTATCACATAAGA^^ 
TGAGAAAGCTTTTATTCCTTTGAATATTTTTTGAAGGTTTCaVGATQAGGTCy^a^ 
TCTGTGTGCATATTTGTTACTTTACTTTTGCTGTTTATACTTGAGACCAACTTTTCAATGT^ 
ACTGGTTTCAAGAATATGGAGGCTGGAAGGAAATAAACATTACGGTACAGACATGGAGATGTAAAA 
ATTATTACAAATATTGTCATCTTTTTCTAGAGTTATCTTCTTTATTATTCCTAGTCTTTCCAGTCAACATCGTGG 

50 ATGTAGTGATTAAATATATCTAGAACTATCyVTTTTTACACTATTGTGAATATTTGGAATTGAACGACTGTATA 
GCTAAGAGGGCCCAAAGAATTGGAATCCTCCTTAATTTAATTGCTTTGAAGCATAGCTACAATTTGTTTT 
TTTTGTTTTGAAAGTTTAACAAATGACTGTATCTAGGCATTTCATTATGCTTTGAACTTTAGT^ 
TCTTGTGTAGATTTGAAAATTGTATACCAATGTGTTTTCTGTAGACTCTAAGATACACTGCACTTTGTTTAGAAA 
AAAAACTGAAGATGAAATATATATTGTAAAGAAGGGATATTAAGAATCTTAGATAACTTCTT6AAAAAGATGGCT 

55 TATGTCATCAGTAAAGTACCTTTATGTTATGAGGATATAATGTGTGCTTTATTGAATTAGAAAATTAGTGACCAT 
TATTCACAGGTGGACAAATGTTGTCCTGTTAATTTATAGGAGTTTTTTGGGGATGTGGAGGTAGTTGGGTAGAAA 
AATTATTAGAACATTCACTTTTGTTAACAGTATTTCTCTTTTATTCTGTTATATAGTGGATC 
CAAAACa^AAAGTACyVTTGCTTAAT^TATATAGTGAAAAATGTCACTATATCTT^ 
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TATTGGGTGTAGATTTCTSICATCAAAACTTGGACCCTTGGAAAACAAAAQij^ 

ACTTACAATTTGCACAATATTTCTTTTGTTGTACTTTATATCTTGTTTACy^TA^ 

AAAAAAAAAA 



5 SEQ ID NO:8 Human TIEG polypeptide 

>Protein sequence protein_id: gil50793 89 
MEERMEMISERPKESiyrySWNKTAEKSDFEAVEALMSMSCSWKSDFKKYV^ 

IPAFCLTPPYSPSDPEPSQVSNLMAPAPSTVHFKSLSDTAKPHIAAPFKEEEKSPVSAPKLPKAQATSVIROT 
AQLCaraQTCPMKAASILNYQNNSFimRTHIjK^^ 
10 ETVICRSQPAPVSPQQKSVLVSPPAVSAGGVPPMPVICQMVPLPANNPVVTTVVPSTPPSQPPAVCPPVVFMGTQ 
VPKGAVMFVVPQPWQSSKPPWSPNGTRLSPIAPAPGFSPSAAKVTPQIDSSRIRSHICSHPGCGKTYF^ 
KAHTRTHTGEKPFSCSWKGCERRFARSDELSRHRRTHTGEKKFACPMCTRRFiyD^ 
MEVSKIiNDIALPPTPAPTQ 

15 SEQ ID NO:9 

TIEG MOUSE HOMOLOG: 

>Nucleotide sequence accession :NM_013692 CDS : 114 .. 1553 

CAGACAGCGCGGAGCGCGGTGGCGTCGACGTCTAGTGTCTCAGTGCTCCCGTCTGTGGCTAACTAAGCAGCCAGC 

AGCCAGGCy^GCTCGCGACCTGCGGCCAGGCAGCCAACCATGCTCAACTTCGGCGCTTCTCTCCAGCAAGCTTCGG 

20 AGGGGAAAATGGAACTAATTTCTGAAAAGCCCAGAGAGGGGATGCATCCCTGGGACAAAGCTGAGCy^C^ 

TTGAAGCGGTGGAAGCGCTCATGTCCATGAGCTGCGACTGGAAGTCTCATTTCaU^GAAATACCTTGAAAAC^ 
CTGTCACACCAGTGTCTGATACCTCCGAGGATGACAGCTTGCTTCCAGGGACGCCTGACCTTCAGACAGTCCCAG 
CATTTTGTTTAACGCCACCTTACAGCCCCTCTGACTTCGAACCCTCCCAAGGGTCAAATCTGACTGCATCAGCGC 
CATCTACTGGCCACTTCAAATCTTTCTCCGATGCTGCCAAGCCTCCAGGCGCCACTCCTTTCAAAGAGGAGGAAA 

25 AGAATCCTTTAGCTGCCCCTCCTCTTCCTAAGGCTCAAGCCACCAGTGTCATCOSTCACyiCAGCTGATGCCC^ 
TGTGCAACCACCAGTCCTGCCCCGTGAAAGCAGCTAGCATCCTCAACTATCAGGACU^ATTCTTT 
CCCACGGAAATGTTGAGGCTACTCGAAAGAACATACCCTGTQCTGCAGTGTCACCAAACAGATCCAAGCCTGAGC 
CCAGCACAGTGTCCGATGGTGATGAGAAGGCGGGCGCTGCACTATATGACTTTGCTGTGCCTTCCTCAGAGACAG 
TAATTTGTAGGTCTCAGCCAGCTCCTTCGTCCCCAGTGCAGAAGTCAGTACTGGTGTCTTCACCTACAGTATCCA 

30 CTGGGGGAGTGCCACCCCTGCCTGTCATCTGCCAGATGGTTCCCCTTCCTGCCAACAACTCTCTTGTTAGCA 
TTGTCCCCAGCACTCCTCCTAGCCAGCCACCAGCTGTCTGCTCACCTGTGTTGTTCATGGGC^ 
AGGGCACCGTCGTGTTTGTGGTACCCCAGCCCGTTGTGCAGAGCCCAAGGCCTCCAGTGGTGAGCCCaVGTGGCA 
CCAGACTGTCTCCCATTGCCCCTGCTCCTGGATTCTCTCCTTCAGCAGCAAGGGTCACTCCTCAGATTGACTCGT 
CCAGAGTAAGAAGTCACATCTGTAGCCACCCAGGGTGTGGCAAGACTTACTTTAAAAGTTCCCATCTGAAGGCCC 

35 ACGTGAGGACACACACAGGGGAAAAACGTTTCAGCTGCAGCTGGAAAGGCTGTGAAAGGAGGTTTGCTCGCTCCG 
ATGAACTGTCCAGACACCGGCGGACACACACAGGTGAGAAGAAGTTTGCCTGTCCCATGTGTGACCGTCGGTTTA 
TGAGGAGCGACCATTTAACCAAGCATGCCCGACGCCACCTATCAGCCAAGAAGCTGCCAAACTGGCAAATGGAAG 
TTAGCAAGTTAAATGACATTGCTCTGCCTCCGACCCCTGCTTCCGCACAGTGACGGCCAGAAGATGGAGACGCAG 
AATAAACTTTGGTTAGAGTCAGGAGCCAGTGATGGTGTCAAGTGCTTCTGCAAGGCTGTGGCCCTCCAAAAGGGC 

40 CTAAAGTAGAAGCCCTGGCCTGGGGGAGGCCCCGCCTGGGTGAAATGACAAGAAGTGCTTCAGCCACAGGCAGGT 
CACAGAGGACAGGGCTCT^GTTCTTACCy^CAGAGAGAGAGGAGAACCCTTTTATTCCTCCCTTATTTTAGTCTGG 
AAGTTTCGGCTGAGGTGAGCGCAGCACAGGTTTTGAATCACATACACATTGGGGACTTTGTTTTTGCCATTTATA 
CTTGAGACCAGCTTTGCAGTGTGATTCTTTCAAAGGATTGGTTTCAAGAATATAGAGGCTGGAAATTACGGTACA 
GAAATGGAGCTAGAAAATGAGTTTGTGTTACACAGAGATGTCATCTTCTCCTAGAGTTATCTTGTTTCTTATTCC 

45 TAGTCTTTCCAGTCAAATCCGTGGATGTAGCTAAGTATATCTAAAACTCATTTTTCCACTATTGTTGGTATTTGA 
AGTTGAACAGCTGTACATTGTTGTGGGGGAGCCAAAGGATTGGAACCCTCATTAATTTAATTGCTTGGAAATGCA 
GCTAAAATTCTTCTTTGGCATTTTGTTTTGAAAGTTTAGGCy^TTTTACTCTACrrTTAGATT^ 
GTTTTTTGTGTAGATTTGAAAATTGTATACCAATGTGTTTTCTGTAGGCTTAAAATACACTCCACTO 
AAAAAATCTGGAGATGAAAATATGTATTATAAAGAAGAGATGTCAAGAATTTGAGATAACTCCTTGAGAAAGTTG 

50 GCTTTATGTCATCAGCAAAGGACACTTAACGTCAAGCATACACTGTGGTTTTTTTGTTTTTTTGT^ 

CAAATTAGAAAGTTTAATGACCGTTACAGATGGACAGTGTCTTTTTATTTATAGGAGTTTTTCAGGATGTCAGAG 

TAGATAGGTAGGAAAATTGTTATTAGAACATTCGCTTCTACCTTGAAAAGGATGTTAATGTGGTCATO 

CACCACAGTGTCTGGGCATCTGGGA?UVCTCCGAGACTTTTTTAAAGTGTCATGATGTGAT 

GGGCATCGAATCCAGGGCCTTGCATGTCTTCTGTAAGAGCTCTCTVTCGCTGACCTGTATCCCCCGCAAGAGC^ 
55 GACTTTTGCTAACAGTATTTCTTTTCTGTTGTAAAGTGGACAGATGATACACTTGGTCGCAAAGGTAAA 

AAAATCCACAGTGAAAACCTCACCACACTTTCCCATTTAAACTATTTCCATATCTCAGAGGTTTCTGACATGCAA 

ACTTGAACCCTTGAAAGAAGAGTTTTCTTAAAAATTATAAAAAATCACGAGTTAC;^ 

TTGAACTTTATACCTTGTTTACAATAAAGACTTTTCTTTGGTATA 
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SEQ ID NO:10 TIEG Mouse polypeptide 
>Protein sequence accession :gi73 05571 

^^^FGASLQQASEGKMELISEKPREGMHPWDKAEQSDFEAVEALMSMSCDWKSHFKKy^ 
5 LLPGTPDLQTVPAFCLTPPYSPSDFEPSQGSNIiTASAPSTGHFKSFSDAAKPPGATPFKEEEKNPLAAPPLPI^ 
ATSVIimTADAQLCNHQSCPVKAASIIJbJYQDNSFRRRTHGNVEATR^ 

ALYDFAVPSSETVICRSQPAPSSPVQKSVLVSSPWSTGGVPPLPVICQMVPLPAKnSTSLVSTW 
CSPVIjFMGTQVPEGTVVFVVPQPWQSPRPPWSPSGTIU.SPIAPAPGFSPSAARWPQIDSS 
GKTYFKSSHLKAHWTHTGEKPFSCSWKGCERRPARSDELSRHWiTHTGBKKFACP^^^ 
1 0 LSAKKIiPNWQMEVSKLNDIALPPTPAS AQ 



SEQ ID NO: 1 1 TIEG Rat polynucleotide 

RAT HOMOLOG: 

>Nucleotide sequence accession :NM_03 113 5 CDS : 316 1758 

15 GGAGGGAACACGGTGCCTCGGGTTGTGTACACGCTCCACTGACAGAGCTTCTTGCAGCCGGGCAGCCGCTGATCA 
CGCGTGGCCCCGCCAGCCCATTGGCTGAGGCCTCACACACCTTTGCCGCTGATTGGTCGATACCACGCCCCGCCT 
TCCACCCCGCCCGCGGCGCGGGGCAGGCAGAGCGGCTACCTGCATGGGGAGGGGGGGCAGACAGCGCGGAGCGCG 
GTGGCGTCGACGTCTAGTGTCTCAGTGCTCCGTCTGTGGCTAGGCAGCCAGCAGCCAAGCAGCTCGCGACCTACG 
GCCAGGCAGCCAACCATGCTTAACTTCGGCGCTTCTCTCCAGCAAGCTTCGGAGGGGAAAATGGAACT^^ 

20 GAAAAGTCCAAAGAGGGAGCACATCCCTGGGACaVAAGCTGAACAGAGTGACTTTCAAGCTGTGG^ 

TCCATGAGCTGCGACTGGAAGTCTCACTTCAAGAAATACCTTGAAAACAGGCCTGTCACACCAGTGTCTGATACC 
TCAGAGGAAGACAGTTTGCTTCCAGGGACGCCTGACCTTCAGACCGTCCCAGCATTCTGTTTGACTCCACCTTAC 
AGCCCCTCTGACTTCGAACCCTCGCa^GGGGTCAAATCTGACGGCACCAGCGCCACCTACTGGTCACTTCAG 
CTCTCCGATGCTGCCAAGCCTCCCAGCATCGCTCCTTTTAAAGAGGAGGAAAAGAGCCCy^TTAGCTGCCCCTCCC 

25 CTCCCTAAGGCTCAAGCCACCAGTGTCATCCGTCACACAGCTGATGCCCAACTCTGTAACCa^CC^^ 

GTGAAAGCAGCTAGCATCCTCAACTACCAGGACAATTCTTTCCGGAGAAGAACCCACATAAATGTCGAGGCTACT 
CGAAAGAACATACCCTGT6CTGCTGTGTCACCAAACAGACCCAAGCCTGAGCCCAGCACAGCGGCGAATGGCGCT 
GAGAAGGCCGGCACTGCGCCGTATGACTTTGCTGTGCCTTCCTCAGAGACAGTCATCTGTAGGTCTTCTCAGCCA 
GCTCCCACGTCCCCAGTACAGAAGTCAGTACTGATGTCTTCACCTACAGTGTCCACTGGGGGAGTGCCACCCTTG 

30 CCTGTCATCTGCCAGATGGTTCCCCTTCCAGCAAACAACTCTCTCGTTACCACAGTTGTCCCCAGCAGTCCGCCC 
AGCCAGCCACCCGCTGTCTGCTCACCCGTGTTGTTCATGGGCACTCAGGTGCCCAAGGGCACGGTCATGTTTGTG 
GTACCCCAGCCCGTTGTACAGAGCCCAAAGCCTCCAGTGGTGAGCCCCAATGGCACCAGACTGTCTCCCATTGCC 
CCTGCGCCTGGATTCTCTCCTTCAGCAGCAAGGGTTACTCCCCAGATTGATTCGTCCAGAGTAAGAAGTCACATC 
TGTAGCCACCCyVGGATGTGGCT^GACTTACTTTAAAAGTTCCCATCTGAAGGCCCACGTGAGAACACACA^ 

35 GAAAAACCTTTCAGCTGCAGCTGGAT^GGCTGTGAAAGGAGGTTTGCTCGTTCCGATGAACTGTCCy^GACACCGG 
CGGACACACACAGGTGAGAAGAAATTTGCCTGTCCCATGTGTGACCGTCGGTTCa^TGAGAAGTGACCATTTAACC 
AAACATGCCCGGCGACACCTATCAGCCAAGAAGCTGCCAAACTGGCAAATGGAAGTGAGCAAGTTAAATGACATC 
GCTCTGCCTCCGGCCACAGCGTCGGCACAGTGACGGCCAGAAGATGGAGACTCAGAATAAACTTTGGTCAGAGTC 
GAGCCGGTGACCGTGTCAAATGCTTCTGCAAGTCTGCGGCCCTCCAACAGGGCCTAAAGCAGAAGCCCCTGGCCT 

40 GGGGGGAGGCCCAGCCTGGGTTAGATGACGAGAAGTGCTTCAGCCACAGGCAGTTCACAGAAGGGCGGGGTTCAT 
TTCTTACCACGTAGAGAGATGAGAACACTCGTGTCCCTTGAGTAGTTTTGGAAGGTTTCGGATGAGATGAGCACA 
GCACAGGTTTTGAATCACATACACATCAGGTACTTTGTTTTTGCCATTTATACTTGAGACCAGCTTTGCAGTGTG 
ATTCTTTTAAAGCGTTGGTTTCGAGAATATAGAGGCTGAAATTACGGTACAGAAATGCAGCTAGAAAGTGAGTTT 
GTGTCATCTTCTCCTAGAGTTATCTTGCTTATTATTCCTAGTCTTTACAGTCAACTCCGTGGACGTGGCTAAGTA 

45 TAGAAAACTATCATTTCTTCCT^CTGTTGTTGGTATTTGAGGTTGAACAGC^ 

GATTGGAACTCTCATTAATTTAATTGCTTGGAAGTGCU^GCTAAAGTTCTTTTTTT^ 
TTAGGCATTTTACTCTACCTTAGATATTAGTTTGCTTGCAGTTTTTTGTGTAGATTTGT^AAACTGTAOV 
TGTTTCCTATAGACTTAAGATACGCTGCGCTTTGTTTAGAAAACAATCTGGCAGTGAAAATATGTGTCGTAAAGA 
AGAGATGTCAAGAATTTGAGATGACTCCTTGAGAAAGATGGCTTATGTCATCAGCAAAGGACACTTACGTCAGAC 
50 GTACACTGTTTCTATCAAATCAGGAAGTTAGTGACCATCACAGATGGACAGGTGTCCTTTTATTTATAGGAGTTT 
GTCAGGATGTCTGAGTAGATAGGTAGGAAATTGTTACTAGAACATTCACTTCCGACTTGACAAGGATATTCATGG 
TGCAGATACGTTCTTCCTGCCGCTGCAGTGTCTGGGTGTCCGGGAAACTCCGAGACTTTAAAGTGTCATAATGTG 
GTCACACCTGCAGTTCTGGGGATCCAGAGCCTTGCTTGTGTTCTGTAAGCTCTGTCACCGAGTTGTATCCCCACC 
TGAAAGCAGTGACTTTTGCTAACAGTATTTCTGTTCTGTTGTAAAGTGGATAGAAGACACACTTGGTAGTAAAGG 
55 TAAATTATTCAAAGTCCACAGTGAAT^CTCACCATACCTTCCCATTTAAACCGTTTCTGTATCTCATAG^ 
TGACATGCAAACTTGAACCCTTGAAAAAAAGAGTTTTCTT 



wo 03/102161 PCT/US03/17725 

6/7 



10 



# 

SEQ ID NO:12 Rat polypeptide 




>Protein secjuence accession:gil3592117 

^^JNFGASLQQASEGKMELISEKSKEGAHPWDKAEQSDFEAVEiU.MSMSaDWKSHFK^ 

LLPGTPDLQTVPAFCLTPPYSPSDFEPSQGSNLTAPAPPTGHFRSLSDAAKPPSIAPFKEEEKSPIiAAPPLPKAQ 

ATSVIRHTADAQLaraQSCPVKAASILNYQDNSFRRRTHINVEAT^ 

APYDFAVPSSEWICRSSQPAPTSPVQKSVLMSSPTVSTGGVPPLPVICQMVPLPAOT^ 

VCSPVIiFMGTQVPKGTVMFVVPQPWQSPKPPWSPNGTRLSPIAPAPGFSPSAARVTPQIDSSRVRSH 

CGKTYFKSSHLKAHVRTHTGEKPPSCSWKGCERRFARSDEIiSRHRRTHTGEK^ 

HLSAKKLPNWQMEVSKLNDIALPPATASAQ 



SEQ ID NO:13 Human TIEG splice variant 



>Nucleotide sequence accession :U21847 CDS:87..1529 

GAATTCGGCACGAGCGCCCGTCTGTGGCCAAGCAGCCAGCAGCCTAGCAGCCAGTCAGCTTGCCGCCGGCGGCCA 

AGCAGCCAACCATGCTCAACTTCGGTQCCTCTCTCCAGCAGACTGCGGAGGAAAGAATGGAAATGAT^ 

1 5 GGCCAAAAGAGAGTATGTATTCCTGGAACAAAACTGCAGAGAAAAGTGATTTTGAAGCTGTAGAAGCACTTATGT 
CAATGAGCTGCAGTTGGAAGTCTGATTTTAAGAAATACGTTGAAAACAGACCTGTTACACCAGTATCTGATTTC 
CAGAGGAAGAGAATCTGCTTCCGGGAACACCTGATTTTCATACAATCCCAGCATTTTGTOT 
GTCCTTCTGACTTTGAACCCTCTC7^TGTCAA?VTCTGATGGCACCAGCGCCATCTACTGTAC^^ 
TCTCAGATACTGCCAAACCTCACATTGCCGCACCTTTCAAAGAGGAAGAAAAGAGCCCAGTATCTGCCCCCAAAC 

20 TCCCCAAAGCTCAGGCAACAAGTGTGATTCGTCATACAGCTGATGCCCAGCTATGTAACCyVCCAGACCTGCCC^ 
TGAAAGCAGCCAGCATCCTCAACTATCAGAACAATTCTTTTAGAAGAAGAACCCACCTAAATGTTGAGGCTGCA^ 
GAAAGAACATACCATGTGCCGCTGTGTCACCAAACAGATCCAAATGTGAGAGA?\ACACAGTGGCAGATGTTGATG 
AGAAAGC^^GTGCTGCyVCTTTATGACTTTTCTGTGCCTTCCTCAGAGACGGTCATCTGCAGGTCTCAGCCAGCCC 
CTGTGTCCCCACAACAGAAGTO^GTGTTGGTCTCTCCACCTGCAGTATCTGCyVGGGGGAGTGCCAC 

25 TCATCTGCCAGATGGTTCCCCTTCCTGCCAACAACCCTGTTGTGACAACAGTCGTTCCCAGCACTCCTCCCAGCC 
AGCCACCAGCCGTTTGCCCCCCTGTTGTGTTCATGGGCACACAAGTCCCCy^AAGGCGCTGTCATGTTTGTGGTAC 
CCCAGCCCGTTGTGCAGAGTTCAAAGCCTCCGGTGGTGAGCCCGAATGGCACCAGACTCTCTCCCATTGCCCCTG 
CTCCTGGGTTTTCCCCTTCAGCAGCAAAAGTCACTCCTCAGATTGATTCATCAAGGATAAGGAGTCACATCTGTA 
GCCACCCAGGATGTGGCAAGACATACTTTAAAAGTTCCCATCTGAAGGCCCACACGAGGACGCACACAGGAGAAA 

30 AGCCTTTCAGCTGTAGCTGGAAAGGTTGTGAAAGGAGGTTTGCCCGTTCTGATGAACTGTCCAGACACAGGCGAA 
CCCACACGGGTGAGAAGAAATTTGCGTGCCCCATGTGTGACCGGCGGTTCATGAGGAGTGACCaiTTTGACCAAGC 
ATGCCCGGCGCCATCTATCAGCCAAGAAGCTACCAAACTGGCAGATGGAAGTGAGCAAGCTAAATGACATTGCTC 
TACCTCCAACCCCTGCTCCCACACAGTGACAGACCGGAAAGTGAAGAGTCAGAACTAACTTTGGTCTCAGCGGGA 
GCCAGTGGTGATGTAAAAATGCTTCCACTGCAAGTCTGTGGCCCCACAACGTGGGCTTAAAGCAGAAGCCCCACA 

35 GCCTGGCACGAAGGCCCCGCCTGGGTTAGGTGACTAAAAGGGCTTCGGCCACAGGCAGGTCACAGAAAGGCAGGT 
ipTCATTTCTTATCACATAAGAGAGATGAGAAAGCTTTTATTCCTTTGAATATTTTOT 

CAACACAGGTAGCACAGATTTTGAATCTGTGTGCATATTTGTTACTTTACTTTTGCTGTTTATACTTGAG^^ 

CTTTTCAATGTGATTCTTCTAAAGCACTGGTTTCAAGAATATGGAAGCTGGAAGGAAATAAACATTACGGTACAG 

ACATGGAGATGTAAAATGAGTTTGTATTATTACAAATATTGTCATCTTTTTCTAGAGTTATCTTCTTTATTAT^ 

40 CTAGTCTTTCOVGTCaVACATCGTGGATGTAGTOATTAAATATATCTAGAACTATC::^ 

ATTTGGAATTGAACGACTGTATATTGCTAAGAGGGCCCAAAGAATTGGAATCCTCCTTAATTTAATTGCT^ 
GCATAGCTACAATTTGTTTTTGCATTTTTGTTTTGAAAGTTTAACAAATGACTGTATCTAGGCATTTCATTATO 
TTTGAACTTTAGTTTGCCTGCAGTTTCTTGTGTAGATTTGAAAATTGTATACCyy\TGTGTTTTCTGTAGACTC 
AGATACa^CTGCS^CTTTGTTTAGAAAAAAAACTGAAGATGAAATATATATTGTAAAGAAGGGATATTAAGAATCTT 

45 AGATAACTTCTTGAAAAAGATGGCTTATGTCATCAGTAAAGTACCTTTATGTTATGAGGATATAATGTGTGCTTT 
ATTGAATTAGAAAATTAGTGACCATTATTCACAGGTGGACAAATGTTCGTCCTGTTAATTTATAGGAG^^ 
GGGATGTGGAGGTAGTTGGGTAGAAAAATTATTAGAACATTCACTTTTGTTT^C^GTATTTCTCTTTTATTCT^ 
TATATAGTGGATGATATACACy^GTGGCAAAACAAAAGTACATTGCTTAAAATATATAGTGAAAAATGTCAC^ 
TCTTCCCATTTAACATTGTTTTTGTATATTGGGTGTAGATTTCTGACATCAAAACTTGGACCCTTGGAAAACA^ 

50 AGTTTTAATTAAAAAAAATCCTTGTGACTTACAATTTGCACAATATTTCTTTTGTTGTACTTTATATCTTGOT 
CAATAAAGAATTCCCTTTGQCA 

SEQ ID NO: 14 Polypeptide encoded by human TIEG splice variant 

>Protein sec[uence protein_id:gill55215 
55 MLNFGASIiQQTAEERMEMISERPKESMYSWNKTAEKSDFEAVEAIiMSMSCSWKSDFK^^ 

PVTPVSDLSEBENIiLPGTPDPirriPAFCLTPPYSPSDFEPSQVSNIiMAPAPSTVHFKSLSDTAKP^ 

KSPVSAPKIjPKAQATSVIimTiU:>AQLaraQTCPMKAASIIiNYQ]^ 
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ssetvicrsqpapvspqqksvlvsppavsotP^p 



RJSrWADVDEKASAALYDFS^SSETVICRSQPAPVSPQQKSVLVSPPAVSOT 
VVPSTPPSQPPAVCPPVVFMGTQVPKGAVMFSA^QPWQSSKPPWSPNGTRL 
SRIRSHICSHPGCGKTYFKSSHLKAHTRTHTGEKPFSCSWKGCERRFARSDELSRHRRTHTG 
MRSDHLTKHARRHLSAKECLPlTOQMEVSKLNDIALPPTPi^ 
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